Senior Site Reliability Engineer (DevTools) in Romania at Jobgether
Explore Related Opportunities
Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Site Reliability Engineer (DevTools) based in Romania.
This is an exciting opportunity to join a highly technical engineering environment focused on building and operating large-scale developer infrastructure. In this role, you will help maintain, optimize, and evolve critical development platforms that support thousands of daily builds, large-scale source code repositories, and extensive artifact storage systems. Working at the intersection of software engineering and site reliability, you will contribute to resilient, self-healing architectures while improving developer productivity and user experience. You will collaborate with talented engineers, solve complex infrastructure challenges, and leverage modern technologies, including AI-powered tools and automation. The position offers significant ownership, technical depth, and the chance to influence the future of engineering productivity at scale.
- Design, operate, and continuously improve large-scale developer infrastructure and internal tooling platforms.
- Build and maintain reliable, fault-tolerant, and self-healing systems that ensure high availability and performance.
- Analyze user feedback, identify pain points, and implement solutions that enhance developer experience and productivity.
- Optimize system performance, reduce operational friction, and improve the efficiency of development workflows.
- Develop, customize, and extend both open-source and commercial tools to better meet organizational needs.
- Contribute to software development initiatives across multiple programming languages and technology stacks.
- Monitor platform health, troubleshoot incidents, and implement preventive measures to improve reliability.
- Collaborate with engineering teams to define meaningful operational metrics and validate improvements through measurable outcomes.
- Support users by resolving technical issues, providing guidance, and ensuring platform stability.
- Explore and integrate emerging technologies, including AI-assisted workflows and developer productivity solutions.
- Proven experience combining Site Reliability Engineering and Software Engineering responsibilities in production environments.
- Strong programming skills and hands-on development experience with languages such as Java, Kotlin, Go, Python, Ruby, or similar.
- Solid understanding of Unix/Linux operating systems, system internals, and infrastructure troubleshooting.
- Strong knowledge of JVM-based applications, performance optimization, and operational best practices.
- Experience designing, operating, and improving highly available and scalable systems.
- Passion for enhancing user experience through engineering excellence and continuous improvement.
- Ability to adapt quickly, solve complex technical problems, and perform effectively in fast-changing environments.
- Strong analytical thinking, troubleshooting capabilities, and attention to detail.
- Excellent communication and collaboration skills within cross-functional engineering teams.
- Experience in Platform Engineering, developer platforms, or internal tooling environments is highly valued.
- Familiarity with version control systems, CI/CD platforms, and build infrastructure such as GitLab, TeamCity, or equivalent solutions is advantageous.
- Experience with Spring Framework, Java-based monolithic applications, or large-scale enterprise systems is considered a plus.
- Comfortable participating in technical assessments and coding interviews as part of the hiring process.
- Competitive compensation package.
- Career development, continuous learning, and professional growth opportunities.
- Flexible working arrangements that support work-life balance.
- Opportunity to work on innovative and impactful AI-driven technologies and infrastructure.
- Collaborative, inclusive, and engineering-focused culture.
- Exposure to complex technical challenges at significant scale.
- International work environment with highly skilled and diverse teams.
- High levels of ownership, autonomy, and influence over technical decisions.
- Opportunity to contribute to the future of developer platforms and cloud technologies.
- Dynamic environment that encourages innovation, bold thinking, and continuous improvement.
- Equal opportunity workplace committed to diversity, inclusion, and fair employment practices.