Software Engineer: MLOps at Field AI – Irvine, California
Explore Related Opportunities
About This Position
Field AI is transforming how robots interact with the real world. We are building risk-aware, reliable, and field-ready AI systems that address the most complex challenges in robotics, unlocking the full potential of embodied intelligence. We go beyond typical data-driven approaches or pure transformer-based architectures, and are charting a new course, with already-globally-deployed solutions delivering real-world results and rapidly improving models through real-field applications.
We are seeking a skilled and motivated MLOps Engineer to join our engineering team. In this role, you will design and maintain the infrastructure and tooling that supports the full lifecycle of machine learning systems used in robotics applications. You will work closely with machine learning engineers, robotics engineers, and infrastructure teams to ensure reliable training, evaluation, deployment, and monitoring of ML models. This is an exciting opportunity to help operationalize machine learning in real-world robotic systems within a fast-growing and dynamic environment.
Design, build, and maintain GPU based infrastructure for machine learning pipelines, including data processing, training, evaluation, inference and deployment workflows.
Collaborate closely with robotics teams to implement model serving infrastructure for edge/robot deployment.
Build tools and automation to support reproducible experiments, model versioning, and dataset management.
Deploy and manage ML services and inference pipelines using containerized environments for efficient scaling and scheduling of heterogeneous compute resources.
Monitor model performance and system reliability across development and production environments.
Improve the efficiency, scalability, and reliability of ML workflows and infrastructure.
Work with cross-functional engineering teams to integrate ML components into robotics software systems.
Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent work experience).
3-7 years of experience in MLOps, machine learning infrastructure, or related engineering roles.
Strong programming skills in Python or similar languages.
Experience building and maintaining machine learning pipelines.
Hands-on experience with cloud and cloud-native tools such as AWS (SageMaker, S3, or similar cloud ML services), Kubernetes etc.,
Solid understanding of Linux systems and distributed computing environments.
Experience with GPU workload scheduling and orchestration across multi-region cloud environments.
Excellent problem-solving skills and the ability to work collaboratively in a team environment.
Experience deploying and operating ML systems for robotics or real-world physical systems.
Experience with scaling AI, ML, and inference workloads on Kubernetes.
Exposure to ROS-based robotics data formats and pipelines (rosbags, point clouds)
Experience with experiment tracking, model versioning, or dataset versioning tools.
Experience optimizing ML pipelines for large-scale training and data processing.
Experience working closely with research or applied machine learning teams.