Especialista SRE | Platform Engineering in Brazil, Indiana at Jobgether
Explore Related Opportunities
Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Especialista SRE | Platform Engineering based in Brazil.
This role plays a strategic part in evolving and operating large-scale cloud and Kubernetes platforms that support mission-critical corporate systems. You will work at the center of platform engineering, enabling development teams through reliable infrastructure, automation, and modern SRE practices. The environment is highly collaborative and focused on scalability, resilience, and continuous improvement across complex distributed systems.
You will contribute to the evolution of Kubernetes platforms across Azure and AWS, ensuring high availability, security, and operational excellence.
The role involves building advanced automation, CI/CD pipelines, and GitOps-based workflows to accelerate software delivery.
You will also be responsible for observability, incident response, and reliability engineering in mission-critical environments.
This is a high-impact opportunity to shape internal developer platforms and drive cloud transformation at scale.
- Evolve and maintain enterprise Kubernetes platforms (AKS/EKS), ensuring scalability, resilience, security, and high availability.
- Design and implement Infrastructure as Code and GitOps-based automation to improve infrastructure provisioning and operations.
- Build, maintain, and optimize CI/CD pipelines using modern tools such as GitHub Actions and ArgoCD.
- Implement and enhance observability solutions including monitoring, logging, and distributed tracing for system reliability.
- Lead incident response activities, perform root cause analysis, and drive long-term reliability improvements.
- Support development teams in adopting best practices in cloud, Kubernetes, automation, and platform engineering.
- Improve internal developer platforms to enhance developer experience and accelerate delivery.
- Optimize cloud environments through autoscaling, capacity management, performance tuning, and cost efficiency.
- Collaborate on architecture, security, and governance standards across Azure and AWS environments.
- Evaluate and implement new technologies related to SRE, platform engineering, and automation.
Requirements:
- Higher education degree completed.
- Strong experience managing and evolving Kubernetes environments, especially AKS and/or EKS.
- Solid experience with public cloud platforms such as Azure and/or AWS.
- Experience building and maintaining CI/CD pipelines and GitOps workflows (GitHub Actions or similar tools).
- Advanced knowledge of Infrastructure as Code tools such as Terraform, Crossplane, or equivalent.
- Strong experience with observability stacks (Grafana, Prometheus, OpenTelemetry, Loki, Tempo or similar).
- Solid Linux, container, and Docker expertise, including troubleshooting and optimization.
- Experience with scripting and automation using Bash, Python, PowerShell, or similar languages.
- Strong understanding of networking, DNS, load balancing, and cloud security concepts.
- Proven ability to operate and improve large-scale, mission-critical distributed systems.
- Nice to have: experience with ArgoCD, Argo Workflows, Karpenter, service mesh (Istio/Linkerd), FinOps, AIOps, and multi-cloud or hybrid architectures.
Benefits:
- Opportunity to work on large-scale cloud and Kubernetes platform engineering initiatives
- Exposure to modern SRE, GitOps, and DevOps best practices
- Work with Azure and AWS environments in a complex enterprise ecosystem
- Strong focus on automation, reliability, and continuous improvement
- Collaborative engineering culture focused on learning and technical excellence
- Opportunity to shape internal developer platforms and engineering standards
- Participation in a major digital transformation and cloud modernization journey
- Competitive benefits package (details provided during hiring process)