Sr. Platform Engineer, L3 in United States at Jobgether
Explore Related Opportunities
Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Sr. Platform Engineer, L3 based in United States.
This is a high-impact Senior Platform Engineer role focused on building and operating the cloud and infrastructure backbone of a fast-scaling AI-driven SaaS platform. You will work at the core of systems that power production environments, ensuring they are scalable, observable, secure, and highly reliable. The role spans cloud infrastructure design in AWS, Kubernetes operations, infrastructure as code, CI/CD pipelines, and modern observability practices. You will collaborate closely with software engineering teams to improve developer experience, deployment velocity, and system resilience. This is a hands-on engineering position where you will contribute directly to architecture decisions while also driving implementation and operational excellence. The environment is fast-moving, collaborative, and iterative, requiring strong technical judgment and adaptability as priorities evolve. Your work will directly support the stability and scalability of platforms used by enterprise customers in advanced scientific and industrial domains.
- Design, build, and maintain scalable, secure, and reliable cloud infrastructure in AWS, ensuring strong operational performance and automation across systems
- Develop and manage Infrastructure as Code solutions using tools such as Terraform and CloudFormation to support repeatable and version-controlled deployments
- Deploy, operate, and optimize Kubernetes clusters in production environments, ensuring high availability and efficient workload orchestration
- Build and maintain CI/CD pipelines using tools such as GitHub Actions, with potential exposure to Jenkins or ArgoCD for deployment automation
- Implement and improve observability systems, including monitoring, logging, alerting, and incident response practices (e.g., Datadog or similar tools)
- Support containerized application workflows, including image build pipelines, optimization, and deployment strategies
- Collaborate with engineering teams to troubleshoot infrastructure issues, perform root-cause analysis, and drive long-term system improvements
- Participate in architecture discussions, technical planning, and ongoing platform evolution initiatives to improve reliability and developer experience
- Strong experience in infrastructure engineering, platform engineering, DevOps, or site reliability engineering roles
- Hands-on expertise with AWS production environments, including infrastructure design and operational management
- Advanced proficiency with Infrastructure as Code tools, particularly Terraform, with practical production-level usage
- Solid experience managing Kubernetes clusters in production, including deployment, configuration, and ongoing maintenance
- Demonstrated ability to design and operate CI/CD pipelines, especially using GitHub Actions
- Experience implementing observability and monitoring solutions such as Datadog, including metrics, logging, and alerting frameworks
- Strong understanding of containerization workflows, including image optimization and efficient build strategies
- Ability to operate effectively in evolving environments where priorities shift and ambiguity is common
- Strong collaboration and communication skills, with a pragmatic, iterative approach to problem-solving
- Experience in startup or high-growth environments and exposure to platform engineering practices is highly valued
- Competitive compensation aligned with senior-level platform engineering roles in the United States
- Equity participation in a fast-growing AI-driven technology company
- Fully covered health, dental, and vision insurance for employees, with partial coverage for dependents
- 401(k) retirement plan with company matching contributions
- Flexible PTO policy plus paid company holidays, including personal milestone days
- Paid parental leave to support family growth and work-life balance
- Annual learning and development budget for professional growth and education
- Technology stipend and monthly phone reimbursement
- Financial wellness support and additional employee assistance resources