Senior Forward Deployed Engineer (Remote) at Jobgether – United States
Jobgether
United States, United States
Posted on
NewJob Function:Information Technology
New job! Apply early to increase your chances of getting hired.
About This Position
Senior Forward Deployed Engineer (Remote)
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Forward Deployed Engineer. In this role, you will play a critical part in connecting advanced AI inference platforms with our customers' production environments. You will work directly with engineering teams to deploy, scale, and optimize large language model systems, solving unique infrastructure challenges that cannot be addressed with standard solutions. Your expertise will ensure high performance and reliability in the deployment of cutting-edge technologies, directly impacting customer success and satisfaction. This collaborative position requires not only technical skills but also the ability to understand and address customer needs effectively.Accountabilities
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Forward Deployed Engineer. In this role, you will play a critical part in connecting advanced AI inference platforms with our customers' production environments. You will work directly with engineering teams to deploy, scale, and optimize large language model systems, solving unique infrastructure challenges that cannot be addressed with standard solutions. Your expertise will ensure high performance and reliability in the deployment of cutting-edge technologies, directly impacting customer success and satisfaction. This collaborative position requires not only technical skills but also the ability to understand and address customer needs effectively.Accountabilities
- Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters.
- Optimize for Production: Develop deployment strategies that enhance performance and meet latency and throughput goals.
- Code Side-by-Side: Collaborate with customer engineers to write production-quality code in Python or Go.
- Solve the 'Unsolvable': Debug complex interactions in model architectures and Kubernetes networking.
- Feedback Loop: Serve as 'Customer Zero' to relay insights back to engineering teams.
- Travel only as needed for presentations, demos, or proof-of-concept executions.
- 8+ Years of Engineering Experience in Backend Systems, SRE, or Infrastructure Engineering.
- Customer Fluency: Ability to communicate technical concepts in business terms.
- Bias for Action: Preference for rapid prototyping and owning outcomes in ambiguous situations.
- Deep Kubernetes Expertise including custom resources and high-performance networking.
- AI Inference Proficiency: Understanding of LLM processes and related optimization techniques.
- Proficiency in Systems Programming with Python and Go.
- Experience with Infrastructure as Code tools such as Helm or Terraform.
- Fluent in deploying LLMs on cloud and bare-metal Kubernetes clusters.
- Comprehensive medical, dental, and vision coverage.
- Flexible Spending Account for healthcare and dependent care.
- Health Savings Account options available.
- Retirement 401(k) with employer match.
- Paid time off and holidays.
- Parental leave plans for all new parents.
- Additional benefits including employee stock purchase plan, tuition reimbursement, and more!
Scan to Apply
Just scan this QR code to apply from your phone.
Job Location
United States, United States