HPC Engineer in United States at Jobgether
Explore Related Opportunities
Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a HPC Engineer based in the United States.
This role sits at the core of scientific and computational innovation, supporting high-performance computing environments that enable life sciences and research breakthroughs. You will design, build, and operate scalable HPC and cloud-based infrastructure that powers complex scientific workloads and data-intensive pipelines. The position blends systems engineering, cloud architecture, and research computing support, requiring close collaboration with scientists, IT teams, and research stakeholders. You will be responsible for ensuring performance, reliability, and scalability across both on-prem and cloud environments. The environment is highly technical and mission-driven, focused on enabling cutting-edge scientific discovery through robust compute platforms. This is a hands-on engineering role where your work directly accelerates research and data-driven innovation.
- Design, deploy, and maintain high-performance computing (HPC) clusters and cloud-based compute environments.
- Support scientific workflows, research pipelines, and compute-intensive applications across life sciences and analytics domains.
- Administer and optimize HPC scheduling systems such as SLURM or Grid Engine.
- Architect and manage cloud infrastructure on AWS and GCP, including migrations and modernization initiatives.
- Implement Infrastructure-as-Code and automation using tools such as Ansible, Terraform, or CloudFormation.
- Perform performance tuning, workload optimization, and system troubleshooting across compute, storage, and network layers.
- Support Linux system administration tasks including installation, configuration, and package management.
- Maintain and enhance integrations with scientific and research applications and platforms (e.g., POSIT tools).
- Ensure system security, compliance, and operational best practices across environments.
- Provide incident response, root-cause analysis, and production support for HPC platforms.
- Document architectures, workflows, and operational procedures for ongoing system maintainability.
- Mentor junior engineers and provide technical guidance across projects and engagements.
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
- 5+ years of experience administering HPC clusters and scientific computing environments.
- Strong experience with Linux system administration and command-line tooling.
- Hands-on experience with HPC schedulers such as SLURM or Grid Engine.
- 5+ years of experience in cloud infrastructure or solution architecture (AWS and/or GCP).
- Experience supporting scientific or research computing workloads, preferably in life sciences.
- Strong understanding of Linux networking, storage systems (NFS/SMB), and directory services (LDAP/AD/DNS).
- Experience with Infrastructure-as-Code tools such as Terraform, Ansible, or CloudFormation.
- Ability to install, configure, and troubleshoot scientific and HPC applications.
- Strong scripting skills (Python, Bash, or similar).
- Excellent communication, documentation, and stakeholder collaboration skills.
- Strong problem-solving mindset with ability to manage complex distributed systems.
- Competitive annual salary based on experience.
- Comprehensive medical, dental, and vision insurance.
- 401(k) retirement plan with company contribution.
- Life and long-term disability insurance provided.
- Paid continuing education and professional development support.
- Fully remote role aligned to a US East Coast schedule.
- Opportunity to work on cutting-edge scientific computing and research initiatives.
- Strong team-oriented culture with long-term career growth potential.