What is the role of a Senior Site Reliability Engineer at Jobgether?

The Senior Site Reliability Engineer position at Jobgether is a Full-time or part-time position opportunity in the relevant field.

Where is this Senior Site Reliability Engineer job located?

United States, Other / Non-US, United States

What type of employment is offered for this Senior Site Reliability Engineer role?

Full-time or part-time position

What industry does this Senior Site Reliability Engineer position belong to?

This role spans multiple industries.

What is the expected salary for this Senior Site Reliability Engineer job?

Compensation will be discussed during the hiring process.

How can I apply for the Senior Site Reliability Engineer position at Jobgether?

You can apply directly through the application link provided.

Senior Site Reliability Engineer at Jobgether

Senior Site Reliability Engineer

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Site Reliability Engineer in the United States.

This role offers a unique opportunity to ensure the reliability, scalability, and performance of critical platform services in a fast-paced, technology-driven environment. The Senior Site Reliability Engineer (SRE) will combine software engineering expertise with operational excellence to automate processes, improve observability, and reduce operational risk across the platform. You will collaborate closely with development, DevOps, release engineering, and security teams to embed reliability and security best practices throughout the software lifecycle. This position emphasizes proactive problem-solving, automation, and continuous improvement while providing mentorship to peers and contributing to high-impact projects. The role is ideal for someone who thrives on solving complex technical challenges while shaping the platform’s resilience and scalability.

Accountabilities:

As a Senior Site Reliability Engineer, you will be responsible for maintaining and improving platform reliability while enabling scalable operations:

Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets for critical services.
Lead capacity planning, performance tuning, design reviews, and disaster recovery exercises to validate platform resilience.
Automate infrastructure provisioning, patching, and operational tasks using Terraform, Ansible, and CI/CD pipelines to eliminate manual processes.
Partner with security teams to enforce compliance (SOC2, CIS benchmarks), implement least-privileged IAM policies, and maintain hardened, secure systems.
Serve as Tier-2 escalation during incidents, lead root cause analysis, and continuously improve incident response playbooks and on-call processes.
Identify repetitive operational tasks and implement automation or self-service modules to reduce toil and improve developer productivity.
Measure system performance, track reliability metrics, and collaborate with leadership to drive iterative improvements.

Requirements:

The ideal candidate combines hands-on technical expertise with strong problem-solving skills and a focus on automation and reliability:

Bachelor’s degree in Computer Science, Engineering, or related field.
Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or Systems Engineering roles.
Strong experience with AWS multi-account environments, Terraform, Ansible, CI/CD tools (GitHub Actions, Bitbucket, Jenkins, AWS CodeBuild/CodePipeline), and observability platforms (New Relic, CloudWatch).
Background with containerized environments (ECS, Fargate, EKS) and resilient system architectures.
Preferred certifications: AWS DevOps Engineer or Solutions Architect, Kubernetes, or SRE/DevOps practitioner certifications.
Excellent analytical, troubleshooting, and problem-solving abilities.
Strong collaboration skills to work effectively with cross-functional teams, mentor peers, and contribute to continuous improvement.

Benefits:

This role provides a comprehensive benefits package designed to support health, growth, and work-life balance:

Competitive salary range: USD $120,000 – $125,000 per year.
Day-one medical, dental, vision coverage with flexible spending options (HSA/FSA).
401(k) with company match available from day one.
Paid sick leave, volunteer time, and parental leave options.
Employer-paid life and disability insurance.
Wellbeing on Demand program to support personal health and wellness.
Flexible work environment with remote opportunities and casual dress code.

Why Apply Through Jobgether?

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1

Senior Site Reliability Engineer at Jobgether – United States

Explore Related Opportunities

About This Position

Scan to Apply

Job Location

Frequently asked questions about this position