Staff AI Software Engineer, Edge Model Optimization & Deployment at Field AI – Seattle, Washington
Field AI
Seattle, Washington, 98101, United States
Posted on
NewJob Function:Information Technology
New job! Apply early to increase your chances of getting hired.
About This Position
Staff AI Software Engineer, Edge Model Optimization & Deployment
FieldAI is transforming how robots interact with the real world. Our growing ML team in Seattle builds risk-aware, reliable, field-ready AI systems that tackle the hardest problems in robotics and unlock the potential of embodied intelligence. We take a pragmatic approach that goes beyond off-the-shelf, purely data-driven methods or transformer-only architectures, combining cutting-edge research with real-world deployment. Our solutions are already deployed globally, and we continuously improve model performance through rapid iteration driven by real field use.
We are seeking an accomplished Staff AI Software Engineer - Edge Model Optimization & Deployment to drive the optimization, integration, and deployment of our ML models on real robotic platforms. In this role, you will own the edge inference stack end to end, profiling and accelerating models, improving runtime performance across latency, throughput, memory, and power, and partnering closely with perception, autonomy, and platform teams to deliver robust on-robot behavior in the field. You will set technical direction, raise engineering rigor, and ensure our models run efficiently and reliably on constrained hardware across diverse environments.
This is an opportunity to shape the future of robotic autonomy by translating state-of-the-art ML into high-performance, production-grade edge deployments that operate reliably in complex, dynamic environments on real robots.
What Youll Do:- Convert and optimize 2D/3D CNNs and Transformer-based models (PyTorch/TensorFlow ONNX TensorRT/Triton) for real-time inference on Jetson/Orin platforms.
- Apply model compression techniquesquantization, pruning, distillation, weight sharingto meet strict constraints on latency, memory, bandwidth, and power.
- Develop custom TensorRT plugins and CUDA kernels for performance-critical components.
- Integrate optimized models into the broader robotic system using ROS nodes and interfaces.
- Build benchmarks, profile and debug end-to-end inference pipelines, and validate performance in real-world robotic scenarios.
- Collaborate closely with AI researchers, robotics engineers, and hardware teams to translate cutting-edge research into robust, deployable edge solutions.
- Ensure the reliability, robustness, and stability of deployed models operating continuously in challenging, resource-constrained environments.
- 5+ years of professional experience developing and deploying deep learning models for edge, embedded, or real-time systems.
- PhD in Computer Science, Robotics, Electrical or Computer Engineering, or a closely related technical field.
- Strong proficiency in PyTorch, C++, Python, and CUDA for AI/ML development and model optimization.
- Hands-on experience with TensorRT, ONNX, and Triton, including authoring custom plugins for TensorRT.
- Proven experience applying model optimization techniques such as quantization, pruning, and distillation in production systems.
- Deep understanding of hardware constraints and performance tuning on Jetson / ARM platforms, GPUs, and embedded Linux systems.
- Experience integrating AI models into ROS-based robotic systems.
- Ability to work independently while collaborating effectively in a fast-paced, cross-functional engineering environment.
- Experience with ROS2.
- Experience writing and optimizing custom CUDA kernels and low-level GPU performance tuning.
- Familiarity with Triton, ML compilers, or compiler-level optimizations for GPU inference.
- Experience with JAX or additional ML frameworks beyond PyTorch.
- Background deploying AI systems on real robots operating in the field, not just offline or in simulation.
- Familiarity with NVIDIAs edge and robotics ecosystem (e.g., Isaac ROS, DeepStream, JetPack).
Scan to Apply
Just scan this QR code to apply from your phone.
Job Location
Seattle, Washington, 98101, United States
Frequently asked questions about this position
Latest Job Openings in Washington
CNC Lathe Machinist (I-IV)
RTC Aerospace
Fife, WA
Tax Preparer
Priority Tax Relief
Spokane, WA
Licensed Veterinary Technician, AESC
Ethos Veterinary Health
Poulsbo, WA
CDL-A - New pay increase - Team Van Truckload truck driver
Schneider
Olympia, WA
Financial Aid Program Specialist II
Bellingham Technical College
Bellingham, WA
Continue to apply
Enter your email to continue. You’ll be redirected to the employer’s application.By clicking Continue, you understand and agree to JobTarget's Terms of Service and Privacy Policy.
Apply Now