System Infrastructure / Platform Engineer in Berkeley, California at Bay Systems Consulting Inc.
Explore Related Opportunities
Job Description
1 year contract - Extension/Conversion based on job performance
Hybrid/Remote
Pay Rate: $90 HR plus benefits
Software / API Engineer to join our team building software systems that integrate scientific workflows and supercomputing systems.
More than ever, scientific discovery transforms our world. NERSC is at the forefront, operating some of the world’s largest supercomputers for thousands of researchers who use computational power to solve society’s most challenging problems.
In this exciting role, you will work on API development and other software engineering projects to help automate the use of supercomputing resources and introduce cloud-native and AI tools and techniques for researchers to use at massive scale. You’ll join a group of systems and software engineers and will routinely work with other groups across NERSC on a variety of projects. You’ll also collaborate with our counterparts at peer scientific facilities, also operated by the Department of Energy Office of Science, on a national program to pool together vast computational and storage resources through the development of APIs, distributed services, and community standards and best practices.
What You Will Do, at Level 3:Work with a team to develop and maintain a broad portfolio of software projects.Build, refine and support API endpoints and integration to backend systems to enable automation for complex workflows.Troubleshoot and solve complex technical problems with other team membersDevelop and refactor scripts and other code.Coordinate small project teams or other initiatives (such as the rollout of a new service or system, or a major equipment or software upgrade).Work with vendors to prioritize efforts and enhance their technologies to meet user needsWork with researchers to deploy services using Spin, our container cloud platform based on Kubernetes.Collaborate within NERSC and across the DOE community to develop APIs and services, integrate them into the new NERSC supercomputer Doudna, the NERSC data center environment, and across multiple DOE facilities.Present developments to NERSC staff and the broader HPC community at science conferences and industry meetings.Additional Responsibilities, at Level 4:Analyze and solve complex technical problems requiring in-depth evaluation of variable factorsWork at a higher level of independence while carrying out work assignments.Research, select, and lead the implementation of new technologies.Develop team strategy and project plans.Provide leadership and technical guidance to group members and other colleagues at NERSC.Recommend and lead system improvement efforts that enhance system performance, reliability, and security.Identify and evaluate emerging HPC technologies and features that could introduce novel capabilities or enhance existing system performance and utility.Represent NERSC in technical or user advocacy groups to influence the HPC and DOE community to meet user needs.What is Required, at Level 3:- Typically, 8+ years of related experience with a Bachelor’s degree; alternatively, 6+ years with a Master’s degree; or equivalent career experience.2+ years of experience with API and web services software development on Linux systems in a high-performance computing, cloud computing, or hyper-scale environment.Familiarity with designing and building API interfaces to compute, storage, or other backend systems.Experience with some or all of our key technologies:C, shell, and Python programming languagesGit, runners, and complex CI/CD pipelinesUsing and developing AI (or machine learning) tools and servicesDatabase administration and optimization (such as MongoDB, MySQL or PostgreSQL)Container technology (such as Docker or Kubernetes)
- Typically, 12+ years of related experience with a Bachelor’s degree; alternatively, 8+ years with a Master’s degree; or equivalent career experience.Experience with OpenAPI and other API frameworks.Experience managing Linux systems.Experience deploying and managing virtualization and/or container technologiesAbility to lead and coordinate projects.Ability to analyze and resolve significant and unique issues requiring evaluation of multiple intangible factors.Ability to exercise independent judgment in methods, techniques and evaluation criteria for obtaining results.