Senior Network Engineer in United States at Jobgether
Explore Related Opportunities
Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Network Engineer based in the United States.
This is an exciting opportunity for an experienced infrastructure professional to help build and scale the networking foundation powering next-generation AI and high-performance computing environments. In this role, you will design, deploy, and optimize large-scale data center networks that support demanding workloads requiring exceptional performance, reliability, and scalability. Working closely with platform, compute, storage, and operations teams, you will contribute to the development of advanced networking architectures and automation frameworks. The position offers significant ownership and autonomy, making it ideal for someone who enjoys solving complex technical challenges and driving infrastructure innovation. You will play a critical role in ensuring operational excellence across distributed environments while helping shape the future of large-scale AI infrastructure. This role combines deep networking expertise with a collaborative, fast-paced engineering culture focused on continuous improvement and technical excellence.
- Design, implement, and maintain scalable spine-leaf network architectures for large-scale data center and AI infrastructure environments.
- Build and optimize high-performance Ethernet fabrics supporting GPU clusters, AI training workloads, and inference operations.
- Manage and enhance network technologies including BGP, EVPN, VXLAN, and advanced Layer 3 routing environments.
- Support and improve low-latency networking solutions, including RDMA, RoCE, and other high-performance transport technologies.
- Develop and maintain backbone, WAN, data center interconnect (DCI), and edge connectivity solutions.
- Collaborate closely with infrastructure, storage, platform, and operations teams to deliver integrated and resilient networking solutions.
- Design and implement Infrastructure-as-Code (IaC) and automation frameworks to streamline network provisioning, configuration management, and operational processes.
- Troubleshoot complex performance, congestion, reliability, and connectivity issues across distributed environments.
- Improve network observability through telemetry, monitoring, analytics, and operational visibility initiatives.
- Contribute to technical documentation, operational procedures, architecture standards, and best practices to support scalability and reliability.
- Participate in infrastructure planning and continuous improvement efforts to ensure network platforms can support future growth.
- Minimum of 5 years of experience designing and operating large-scale data center networking environments.
- Hands-on expertise with Cumulus Linux (Cumulus NOS) in production environments.
- Strong experience with spine-leaf architectures, Layer 3 fabrics, and modern data center networking principles.
- Advanced knowledge of BGP, EVPN, VXLAN, and large-scale routing environments.
- Experience supporting high-performance computing (HPC), GPU-intensive, or AI-focused infrastructure environments.
- Proven ability to design scalable systems capable of supporting thousands of nodes and large distributed workloads.
- Experience with network automation using Python, Ansible, Terraform, or similar tools.
- Familiarity with network observability platforms, telemetry pipelines, monitoring systems, and performance analysis tools.
- Experience working with SONiC, Junos, cloud networking technologies, VPCs, Direct Connect, Cloud Connect, NFV, or related solutions.
- Strong troubleshooting and problem-solving skills within complex infrastructure environments.
- Excellent communication, collaboration, and documentation abilities.
- Experience with NVIDIA networking technologies, InfiniBand, RoCE, RDMA, multi-region backbone design, or bare-metal provisioning is highly desirable.
- Ability to work independently, prioritize effectively, and thrive in a fast-paced, high-growth environment.
- Must be authorized to work in the United States without current or future visa sponsorship requirements.
- Competitive base salary ranging from $150,000 to $190,000 USD annually.
- Eligibility for discretionary performance bonuses.
- Meaningful equity participation opportunities.
- Comprehensive medical, dental, and vision insurance coverage.
- Retirement savings and financial wellness programs.
- Generous paid time off and company holidays.
- Paid parental leave.
- Professional development and learning support.
- Wellness stipends and work-from-home allowances.
- Flexible remote-first work environment.
- Fully remote position within the continental United States.
- Opportunities to collaborate with highly skilled engineers working on cutting-edge AI infrastructure.
- Occasional team and company offsite events.