JobTarget Logo

Senior DevOps Engineer in United States at Jobgether

NewJob Function: Engineering
Jobgether
United States, United States
Posted on
New job! Apply early to increase your chances of getting hired.

Explore Related Opportunities

Job Description

Senior DevOps Engineer

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior DevOps Engineer based in the United States.

This role sits at the heart of platform reliability and delivery, ensuring that engineering teams can ship safely, quickly, and at scale across complex cloud-native environments. You will be responsible for operating and improving Kubernetes-based infrastructure, CI/CD pipelines, and observability systems that power mission-critical applications. The environment is highly hands-on, combining incident response, automation, and continuous improvement across deployment and runtime systems. You will work closely with engineering teams to strengthen release processes, reduce operational friction, and improve system resilience. This position requires a strong DevOps mindset with deep technical fluency across cloud, automation, and monitoring tools. It is ideal for someone who thrives in fast-moving environments where reliability and efficiency are equally important.

Accountabilities:

You will be responsible for ensuring the stability, scalability, and efficiency of platform operations while enabling engineering teams to deliver software reliably and safely. This includes:

  • Operating and improving platform tooling to support reliable software delivery, including ticket triage, issue resolution, and service request handling
  • Maintaining and evolving self-service workflows, including documentation, templates, and deployment guardrails
  • Managing Kubernetes environments, including Helm deployments, namespace management, rollout troubleshooting, and incident response support
  • Supporting and enhancing CI/CD pipelines (primarily GitLab CI), including job configuration, deployment strategies, and quality gates
  • Monitoring and improving observability systems using tools such as Prometheus, Alertmanager, Thanos, and OpenTelemetry
  • Maintaining dashboards, alerts, and SLO/SLA indicators while reducing noise and improving signal quality
  • Supporting service instrumentation across metrics, logs, and traces using OpenTelemetry
  • Participating in on-call rotations, incident response, and post-incident documentation and improvements
  • Driving automation and cost optimization efforts, including resource right-sizing and operational efficiency improvements
  • Contributing to documentation, runbooks, onboarding guides, and operational playbooks
Requirements:

The ideal candidate is an experienced DevOps or SRE professional with strong automation skills, deep cloud-native expertise, and a focus on operational excellence in production environments.

  • 8+ years of experience in DevOps, SRE, or platform engineering roles
  • Strong hands-on experience with Kubernetes and related ecosystem tools (Helm, Docker, ingress controllers, etc.)
  • Solid experience with CI/CD systems, preferably GitLab CI, including pipeline design and deployment strategies
  • Strong scripting ability in Bash or Python (Go is a plus) for automation and tooling
  • Practical experience with AWS services such as IAM, EC2/EKS, S3, CloudWatch, and Secrets Manager
  • Deep understanding of observability concepts including metrics, logs, tracing, and alerting systems
  • Experience with Prometheus, Alertmanager, Thanos, and OpenTelemetry
  • Comfortable working in ticket-driven environments (Jira, ServiceNow) and following change management processes
  • Strong communication skills and ability to collaborate with engineering and product teams
  • Bonus: Terraform experience for infrastructure as code and AWS/Kubernetes provisioning
  • Bonus: API integration experience (Python, Java, or Go) for internal tooling
  • Bonus: Strong Linux and container runtime debugging knowledge
  • Bonus: Exposure to regulated industries such as finance or insurance environments
Benefits:
  • Competitive compensation package aligned with experience
  • Fully remote role within the United States
  • Opportunity to work on large-scale, cloud-native infrastructure systems
  • High-impact role focused on reliability, automation, and platform engineering excellence
  • Exposure to modern DevOps tooling including Kubernetes, CI/CD, and observability stacks
  • Collaborative engineering culture focused on continuous improvement and innovation
  • Opportunity to work in fast-paced environments solving complex technical challenges
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1

Job Location

United States, United States

Frequently asked questions about this position

Continue to apply
Enter your email to continue. You’ll be redirected to the employer’s application.
By clicking Continue, you understand and agree to JobTarget's Terms of Use and Privacy Policy.