SRE Team Lead (GameDev) in UK at Jobgether
Explore Related Opportunities
Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a SRE Team Lead (GameDev) in United Kingdom.
Join a high-impact engineering team responsible for ensuring the reliability, scalability, and performance of a fast-growing gaming platform serving international markets. This role offers the opportunity to lead Site Reliability and DevOps initiatives while shaping infrastructure strategy and engineering best practices. You will work closely with development teams to enhance platform stability, optimize deployment processes, and drive automation across complex cloud-native environments. Combining technical leadership with hands-on engineering, you'll mentor a talented team and influence architectural decisions that support long-term growth. If you're passionate about Kubernetes, observability, cloud infrastructure, and building resilient systems at scale, this is an opportunity to make a meaningful impact in a dynamic and innovation-driven environment.
- Lead, mentor, and support a team of DevOps and SRE engineers, fostering technical growth and engineering excellence.
- Define and evolve Site Reliability Engineering and DevOps practices, processes, and standards across the organization.
- Contribute to technical strategy, infrastructure roadmaps, and the continuous improvement of engineering culture.
- Design, maintain, and enhance highly available, scalable, and resilient cloud-native infrastructure.
- Develop and expand monitoring, observability, alerting, and incident management capabilities to ensure system reliability.
- Participate in and coordinate on-call rotations while improving incident response and root cause analysis processes.
- Automate infrastructure provisioning, operational workflows, and repetitive tasks using Infrastructure as Code and scripting.
- Collaborate closely with development teams to improve system reliability, deployment pipelines, CI/CD processes, and overall platform performance.
- Promote Kubernetes-native approaches and provide technical mentorship on cloud and platform engineering practices.
- Support architectural decision-making and contribute to the evolution of cloud infrastructure and operational excellence.
- 5+ years of experience in DevOps, Site Reliability Engineering, or related infrastructure-focused roles.
- Previous leadership experience or a strong desire and capability to take ownership of a technical team.
- Deep expertise in Kubernetes and container orchestration platforms, with at least 4 years of hands-on experience.
- Strong experience with Terraform and Infrastructure as Code practices.
- Proven experience working with Oracle Cloud Infrastructure and managing cloud-based environments.
- Solid background building and maintaining highly available and fault-tolerant systems.
- Experience managing both SQL databases (particularly PostgreSQL) and NoSQL technologies.
- Strong knowledge of observability tools, including Prometheus, Grafana, exporters, monitoring, and alerting systems.
- Proficiency in automation and scripting using Python and Bash.
- Strong understanding of CI/CD pipelines, GitOps methodologies, and platform engineering concepts.
- Excellent troubleshooting skills with a structured approach to root cause analysis and reliability improvements.
- Proactive mindset focused on automation, operational efficiency, and continuous improvement.
- Strong communication and collaboration skills with experience working across engineering teams.
- Nice to have: Certified Kubernetes Administrator (CKA) certification.
- Nice to have: experience with AWS and Google Cloud Platform.
- Nice to have: ability to read and understand JavaScript, TypeScript, or Ruby codebases.
- Fully remote work environment with flexibility to work from the location that suits you best.
- Competitive compensation package.
- Opportunity to lead and shape infrastructure strategy within a rapidly growing technology organization.
- Clear career development framework with performance reviews, mentoring programs, and advancement opportunities.
- Dedicated learning budget for professional courses, certifications, workshops, and training programs.
- Corporate English lessons and access to educational resources and online libraries.
- Private medical insurance and mental health support programs.
- Generous paid vacation, public holidays, and sick leave.
- Monthly flexible benefits budget that can be used for hobbies, sports, wellness, and personal interests.
- Regular team-building activities, workshops, and company events.
- Collaborative, low-bureaucracy culture that encourages autonomy, innovation, and ownership.
- Opportunity to work with modern technologies and contribute to large-scale cloud-native platforms.