Forward Deployed Machine Learning Engineer at OmegaHires – San Francisco, California
OmegaHires
San Francisco, California, 94101, United States
Posted on
Job Function:Information Technology
Explore Related Opportunities
Miscellaneous Computer Occupations jobs in CaliforniaJobs in CaliforniaMiscellaneous Computer Occupations jobs
About This Position
Title: Forward Deployed Machine Learning Engineer
Job Type: Contract / FTE (W2 only)
Location: San Francisco, CA - Onsite/ Hybrid/ Remote Role OverviewWe’re seeking a high-agency Forward Deployed / Applied ML Engineer to bridge cutting-edge generative AI research with real-world production systems. You’ll work directly with customers to deploy, optimize, and customize our FLUX diffusion models across diverse environments, from on-prem GPU clusters to hosted infrastructure.
Key ResponsibilitiesDeploy and optimize FLUX diffusion models in customer environments, balancing latency, cost, and output qualityArchitect deep product integrations beyond APIs, including model hosting, inference optimization, and production deployment
Fine-tune and customize foundation models for customer-specific visual media use cases
Lead technical deep dives with customers to diagnose model, infrastructure, and performance issues
Translate customer challenges into actionable engineering solutions and research feedback
Identify emerging industry use cases for generative visual AI
Required QualificationsHands-on experience deploying and serving generative AI / deep learning models in production
Strong expertise in diffusion models, model fine-tuning, optimization, and inference
Proven experience as an ML Engineer shipping models used by real systems
Strong Python skills and experience designing and consuming APIs
Ability to communicate complex ML tradeoffs to both technical and non-technical stakeholders
Experience working directly with customers on technical AI integrations
Know the FLUX ecosystem intimately—ComfyUI, common training frameworks, the tools practitioners actually use
Preferred QualificationsDeep knowledge of diffusion models, flow matching, distillation, and advanced fine-tuning techniques
Experience optimizing inference for transformer-based models under real production constraints
Experience deploying models on cloud platforms with modern serving infrastructure
Background contributing to open-source ML / diffusion model projects
Experience designing solutions in constrained enterprise environments
Job Type: Contract / FTE (W2 only)
Location: San Francisco, CA - Onsite/ Hybrid/ Remote Role OverviewWe’re seeking a high-agency Forward Deployed / Applied ML Engineer to bridge cutting-edge generative AI research with real-world production systems. You’ll work directly with customers to deploy, optimize, and customize our FLUX diffusion models across diverse environments, from on-prem GPU clusters to hosted infrastructure.
Key ResponsibilitiesDeploy and optimize FLUX diffusion models in customer environments, balancing latency, cost, and output qualityArchitect deep product integrations beyond APIs, including model hosting, inference optimization, and production deployment
Fine-tune and customize foundation models for customer-specific visual media use cases
Lead technical deep dives with customers to diagnose model, infrastructure, and performance issues
Translate customer challenges into actionable engineering solutions and research feedback
Identify emerging industry use cases for generative visual AI
Required QualificationsHands-on experience deploying and serving generative AI / deep learning models in production
Strong expertise in diffusion models, model fine-tuning, optimization, and inference
Proven experience as an ML Engineer shipping models used by real systems
Strong Python skills and experience designing and consuming APIs
Ability to communicate complex ML tradeoffs to both technical and non-technical stakeholders
Experience working directly with customers on technical AI integrations
Know the FLUX ecosystem intimately—ComfyUI, common training frameworks, the tools practitioners actually use
Preferred QualificationsDeep knowledge of diffusion models, flow matching, distillation, and advanced fine-tuning techniques
Experience optimizing inference for transformer-based models under real production constraints
Experience deploying models on cloud platforms with modern serving infrastructure
Background contributing to open-source ML / diffusion model projects
Experience designing solutions in constrained enterprise environments
Scan to Apply
Just scan this QR code to apply from your phone.
Job Location
San Francisco, California, 94101, United States
Loading interactive map for San Francisco, California, 94101, United States
Job Location
This job is located in the San Francisco, California, 94101, United States region.
Frequently asked questions about this position
Latest Job Openings in California
Direct Support Professional
Lynn & Darla, LLC
Napa, CA
Assistant Site Supervisor (Escondido)
Heartbeat Music & Performing Arts Academy
Escondido, CA
Traveling Foreman Electrician - San Diego, CA
Alamon Inc.
San Diego, CA
Operating Technician (Sun-Wed 5:30am-5:30pm)
B. Braun US Pharmaceutical Manufacturing LLC
Irvine, CA
Registered Nurse (RN) - Full-Time
Hydration Room, Inc.
Irvine, CA