Platform Engineer (Database Reliability) in Canada Creek, Nova Scotia at Jobgether
Explore Related Opportunities
Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Platform Engineer (Database Reliability) in Canada.
This role sits at the core of production reliability, focusing on ensuring the stability, performance, and scalability of critical database systems that power high-traffic ecommerce platforms. You will be responsible for maintaining and optimizing production MySQL environments while contributing to broader platform engineering initiatives across cloud infrastructure, automation, and observability. Working in a fast-paced, cloud-native environment, you will collaborate closely with engineering and platform teams to strengthen system resilience and operational maturity. This position blends hands-on database expertise with modern DevOps and SRE practices, offering the opportunity to directly impact system uptime and performance. You will also play a key role in incident response, root cause analysis, and continuous improvement of production systems. This is an ideal opportunity for an engineer passionate about reliability, automation, and scalable infrastructure.
- Ensure the reliability, scalability, and operational excellence of production MySQL database environments, including replication, failover, backup, recovery, upgrades, and performance tuning.
- Support and enhance cloud infrastructure and platform services, contributing to automation, infrastructure-as-code, and operational tooling using technologies such as Terraform.
- Improve system observability by strengthening monitoring, alerting, and logging across databases, infrastructure, and application layers.
- Participate in incident response, troubleshooting, root cause analysis, and remediation of production issues across database and infrastructure systems.
- Collaborate with engineering and platform teams to improve system reliability, scalability, and deployment readiness in production environments.
- Contribute to CI/CD pipelines, automation initiatives, and operational best practices to increase efficiency and reduce manual interventions.
- Participate in an on-call rotation to support high availability and continuity of critical systems.
- 5+ years of experience in Platform Engineering, Site Reliability Engineering, DevOps, Database Reliability, or Infrastructure Engineering roles.
- Strong hands-on experience managing production MySQL environments, including replication, failover, backups, recovery, and performance optimization.
- Experience working with cloud infrastructure (GCP or similar environments) supporting scalable, highly available systems.
- Proficiency with infrastructure-as-code and automation tools such as Terraform, along with exposure to Kubernetes, Docker, and CI/CD pipelines.
- Strong Linux systems administration skills with experience in production troubleshooting and incident response.
- Experience with monitoring, observability, and alerting tools to ensure system reliability and performance.
- Scripting and automation ability using languages such as Python, Go, Bash, or JavaScript.
- Strong analytical, communication, and problem-solving skills with the ability to operate independently in fast-paced environments.
- Competitive salary aligned with experience and technical expertise.
- Employer-paid health and dental coverage starting from day one.
- Annual health spending allowance to support personal wellness.
- Access to virtual mental health support and employee assistance programs.
- Equity options enabling long-term participation in company growth.
- Remote-first environment with flexibility to work from anywhere in Canada.
- Flexible working hours to support work-life balance.
- Generous paid vacation policy and time off benefits.