Director of Cloud Operations at Jobgether – United States
Explore Related Opportunities
About This Position
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Director of Cloud Operations in United States.
This leadership role is responsible for ensuring the reliability, scalability, and operational excellence of a globally distributed SaaS cloud platform that supports millions of daily users across major enterprise customers. The Director of Cloud Operations will lead a distributed engineering team spanning the US and UK, driving the evolution of cloud infrastructure, observability, and incident management practices. This role blends deep technical expertise with hands-on leadership, ensuring systems are not only stable and performant but continuously improving through automation and modern cloud-native practices. You will collaborate closely with Engineering, Security, and Product teams to strengthen system resilience and operational maturity across multi-region AWS environments. A key focus of the role is balancing innovation with stability, particularly in hybrid environments that include both modern cloud systems and legacy infrastructure. The environment is fast-paced, collaborative, and mission-driven, with a strong emphasis on accountability, reliability, and continuous improvement.
- Own the availability, performance, scalability, and resilience of a multi-region AWS cloud platform supporting large-scale SaaS services.
- Define and drive reliability engineering practices, including SLIs/SLOs, error budgets, and proactive system improvement initiatives.
- Lead incident management processes, including on-call rotations, escalation workflows, and post-incident reviews to reduce MTTR and improve system recovery.
- Oversee architecture and operational strategy for microservices, Kubernetes (EKS), and serverless workloads to ensure scalability and fault tolerance.
- Advance observability practices using modern monitoring tools to deliver actionable insights across infrastructure and application layers.
- Drive operational efficiency through automation, CI/CD optimization, infrastructure-as-code practices, and AI-assisted operational workflows.
- Lead cost optimization efforts across cloud environments while maintaining performance, reliability, and security standards.
- Manage and develop a distributed CloudOps engineering team, fostering accountability, technical excellence, and continuous learning.
- Ensure stable operations of hybrid environments, including legacy systems hosted in private data centers alongside modern cloud infrastructure.
- 10+ years of experience in cloud infrastructure, DevOps, or Site Reliability Engineering, including leadership of CloudOps or SRE teams.
- Proven experience operating and scaling multi-region, customer-facing SaaS platforms in high-availability environments.
- Strong hands-on expertise with AWS, Kubernetes (EKS), Terraform, CI/CD pipelines, and modern cloud-native architectures.
- Deep understanding of distributed systems, microservices architecture, and reliability engineering principles.
- Experience with observability platforms and incident management practices, including on-call operations and production support.
- Strong knowledge of SLO/SLI frameworks, system performance tuning, and operational best practices.
- Demonstrated ability to lead hybrid teams while balancing strategic leadership with hands-on technical contribution.
- Excellent collaboration and communication skills with the ability to influence across engineering, product, and security teams.
- Pragmatic, outcomes-driven leadership style with a focus on continuous improvement and measurable impact.
- Competitive base salary range: $200,000 – $228,000
- Remote-first work model with flexible working arrangements
- Comprehensive health, dental, and vision insurance coverage
- Generous paid time off program and paid holidays
- Inclusive parental leave policies
- 401(k) retirement plan and financial wellness support
- Home office support and equipment provision
- Learning, growth, and professional development opportunities
- Collaborative and inclusive culture focused on innovation and impact.