Software and Web Developers, Programmers, and Testers Jobs Near Me in California
Showing 696 Software and Web Developers, Programmers, and Testers jobs available near me in California.
Member of Technical Staff, Forward Deployed AI Engineer
Inception
San Mateo, California
Lead Backend Engineer, Content Management (PHP/Laravel)
EMPIRE
San Francisco, California
iOS Developer Intern
Reply
Mi Wuk Village, California
Android Developer Intern
Reply
Mi Wuk Village, California
Software Engineer - Backend (Varying Levels)
Windfall
San Francisco, California
Junior Developer
JLab Audio
Carlsbad, California
Applied AI Developer (Remote)
CrowdStrike, Inc.
Sunnyvale, California
Senior QA Automation Engineer
Lytx, Inc.
San Diego, California
Test and Evaluation (T&E) Team Lead - SIGINT - San Diego
Epsilon Systems Solutions, Inc
San Diego, California
Senior Software Engineer, Frontend
Ascent
San Diego, California
10469 – Sr. QA Analyst
Hyundai Autoever America
Costa Mesa, California
10850 – Sr. Software Engineer, Applied AI
Hyundai Autoever America
Irvine, California
Federal Salesforce Developer
Thunder
San Francisco, California
Federal Salesforce Developer (Mobile)
Thunder
San Francisco, California
Low-Code/Application Developer
Sellers & Associates
Washington, California
Sr. Product Manager - 3D Web Development
ESRI, Inc
Redlands, California
Fullstack PHP developer (Magento 2 + WordPress) - Remote. Latin America
Bluelight Consulting
San Jose, California
Senior Software Engineer, Mapping Platform
Field AI
Irvine, California
Showing 18 of 696 results
Member of Technical Staff, Forward Deployed AI Engineer in San Mateo, California at Inception
Recently UpdatedJob Function: Information Technology
Inception
San Mateo, California, 94401, United States
Posted on
Explore Related Opportunities
Software and Web Developers, Programmers, and Testers jobs near me in CaliforniaJobs near me in CaliforniaSoftware and Web Developers, Programmers, and Testers jobs
Job Description
Inception creates the world’s fastest, most efficient AI models. Our Mercury model is the world’s fastest reasoning LLM and first commercially available diffusion LLM, delivering 5x greater speed and efficiency than today’s LLMs, with best-in-class quality.
We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO.
The RoleInception is hiring Forward Deployed AI Engineers to help enterprise customers deliver the highest quality AI experiences using our diffusion-based language models.
This role sits at the intersection of product engineering, customer implementation, evals, data collection, model optimization, and enterprise deployment ownership. You will work directly with enterprise customers to identify high-value AI workflows, collect and structure customer data, build LLM-as-judge evaluation systems, tune model and product behavior for customer-specific goals, and turn fast proof-of-concepts into production deployments.
This is not a traditional solutions engineering role, a pure research role, or a long-cycle consulting implementation role. We are looking for full-stack engineers who can operate close to customers, build real systems, communicate clearly, and move fast — including running fast POC cycles that take weeks to produce customer impact rather than exploratory research projects that take months.
As an early member of the team responsible for turning Mercury models into high-value enterprise deployments and building the customer data flywheel that improves our models, products, and go-to-market motion. You will work closely with platform, serving, post-training, product engineering, and GTM teams to translate customer deployment learnings into model, product, and infrastructure improvements.
Key Responsibilities
Qualifications
Preferred Skills
A Note on the RoleThis role is for builders who want to be close to customers and close to the product.
We are not looking for traditional solutions engineers who only configure demos, nor researchers who primarily want to work on open-ended model experiments. The strongest candidates are full-stack engineers with enough ML fluency to work across LLM systems, evals, data, tuning, deployment, and production application development — and enough customer instinct to discover what matters, build quickly, and drive real adoption.
This role is also not just about serving one-off customer requests. The best FDEs will identify repeatable patterns across deployments and turn those learnings into better product surfaces, platform capabilities, evals, playbooks, and model feedback loops.
A Note on Startup FitThis is an in-office role at an early-stage company moving with high velocity. We're looking for engineers who are actively seeking a startup environment — comfortable with ambiguity, customer-facing work, rapid iteration, and end-to-end ownership.
The team is small and high-leverage. You should be excited to work directly with enterprise customers, own ambiguous problems, and build the systems that convert customer demand into production AI deployments.
Why Join Inception
Perks & Benefits
About UsInception creates the world’s fastest, most efficient AI models. Today’s autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception’s diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality.
Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion.
We pioneered the application of diffusion to language, with world’s first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today’s image and video AI, and we’re making it the standard for LLMs as well.
Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft’s venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt.
If you are talented, innovative, and ambitious, come help us invent the future of AI.We are an equal opportunity employer and encourage candidates of all backgrounds to apply.
We are the AI researchers and engineers behind such breakthrough AI technologies as diffusion models, flash attention, and DPO.
The RoleInception is hiring Forward Deployed AI Engineers to help enterprise customers deliver the highest quality AI experiences using our diffusion-based language models.
This role sits at the intersection of product engineering, customer implementation, evals, data collection, model optimization, and enterprise deployment ownership. You will work directly with enterprise customers to identify high-value AI workflows, collect and structure customer data, build LLM-as-judge evaluation systems, tune model and product behavior for customer-specific goals, and turn fast proof-of-concepts into production deployments.
This is not a traditional solutions engineering role, a pure research role, or a long-cycle consulting implementation role. We are looking for full-stack engineers who can operate close to customers, build real systems, communicate clearly, and move fast — including running fast POC cycles that take weeks to produce customer impact rather than exploratory research projects that take months.
As an early member of the team responsible for turning Mercury models into high-value enterprise deployments and building the customer data flywheel that improves our models, products, and go-to-market motion. You will work closely with platform, serving, post-training, product engineering, and GTM teams to translate customer deployment learnings into model, product, and infrastructure improvements.
Key Responsibilities
- Enterprise customer deployments: Work directly with strategic enterprise customers to identify high-value AI workflows and turn them into production deployments.
- Rapid prototyping: Build and run fast proof-of-concepts, iterating on customer requirements and technical constraints on 2-week cycles.
- Production AI applications: Build full-stack AI applications, agentic workflows, integrations, internal tools, and customer-facing systems that bring Inception models into real enterprise environments.
- Data collection & feedback loops: Collect, structure, and operationalize customer data to improve model and product performance on customer use cases.
- Measurement and Evaluation: Define success metrics for customer deployments and design LLM-as-judge workflows, evaluation harnesses, and feedback loops for customer-specific use cases.
- Model and product optimization: Tune and customize Mercury models, prompts, workflows, and system architecture to meet customer-specific performance goals.
- Agentic workflows: Build and optimize agentic workflows including subagents involving classification, routing, context compaction, search, coding agents, voice, and other latency-sensitive applications.
- Build, prove, and generalize: Turn customer-specific deployments into repeatable product patterns, eval frameworks, implementation playbooks, and platform capabilities that improve Inception’s core product.
Qualifications
- BS/MS/PhD in Computer Science, Machine Learning, or a related field (or equivalent experience).
- Strong engineering skills in Python and modern full-stack development, including APIs, backend systems, and ideally TypeScript/JavaScript.
- Experience building, deploying, or integrating AI/LLM products with real users or customers.
- Familiarity with LLM evaluation, LLM-as-judge workflows, data pipelines, model tuning, prompt optimization, or agentic workflows.
- Customer-facing experience with enterprise, strategic, or high-value accounts.
- Experience deploying software or AI systems in enterprise environments with security, privacy, reliability, compliance, or integration constraints.
- Strong communication and discovery skills, with the ability to translate ambiguous customer needs into concrete technical solutions.
- Ability to operate across engineering, product, sales, and customer success without requiring heavy process or handholding.
- Willingness to work directly with customers in person when needed, including occasional travel for strategic deployments, workshops, and executive technical sessions.
Preferred Skills
- Experience with RAG, search, voice AI, coding agents, or agentic workflow systems.
- Experience deploying AI systems for Fortune 500 or large enterprise customers.
- Track record owning technical pre-sales, post-sales, implementation, or customer expansion for million-dollar enterprise accounts.
- Familiarity with LLM serving, latency optimization, model evaluation, or production ML systems.
- Experience with data engineering, synthetic data generation, or feedback loops for model improvement.
- Background in product engineering, ML product engineering, applied AI, or forward deployed engineering.
- Experience working with customer-specific evals, benchmarks, and performance targets.
- Familiarity with latency-sensitive applications, especially voice systems where response speed is critical.
A Note on the RoleThis role is for builders who want to be close to customers and close to the product.
We are not looking for traditional solutions engineers who only configure demos, nor researchers who primarily want to work on open-ended model experiments. The strongest candidates are full-stack engineers with enough ML fluency to work across LLM systems, evals, data, tuning, deployment, and production application development — and enough customer instinct to discover what matters, build quickly, and drive real adoption.
This role is also not just about serving one-off customer requests. The best FDEs will identify repeatable patterns across deployments and turn those learnings into better product surfaces, platform capabilities, evals, playbooks, and model feedback loops.
A Note on Startup FitThis is an in-office role at an early-stage company moving with high velocity. We're looking for engineers who are actively seeking a startup environment — comfortable with ambiguity, customer-facing work, rapid iteration, and end-to-end ownership.
The team is small and high-leverage. You should be excited to work directly with enterprise customers, own ambiguous problems, and build the systems that convert customer demand into production AI deployments.
Why Join Inception
- Work with World-Class Talent: Collaborate with the inventors of diffusion models and leading AI researchers
- Shape Foundational Technology: Your decisions will influence how the next generation of AI products are built and used
- Immediate Impact: Join at the ground floor where your contributions directly shape product direction and company trajectory
Perks & Benefits
- Competitive salary and equity in a rapidly growing startup
- Flexible vacation and paid time off (PTO)
- Health, dental, and vision insurance
- Catered meals (breakfast, lunch, & dinner)
- Commuter subsidies
- A collaborative and inclusive culture
About UsInception creates the world’s fastest, most efficient AI models. Today’s autoregressive LLMs generate tokens sequentially, which makes them painfully slow and expensive. Inception’s diffusion-based LLMs (dLLMs) generate answers in parallel. They are 5x faster and more efficient, while delivering best-in-class quality.
Inception was co-founded by Stanford professor Stefano Ermon, who co-invented such breakthrough AI technologies as diffusion models, flash attention, and DPO, UCLA professor Aditya Grover, who co-invented node2vec, decision transformers, and d1 reasoning, and Cornell professor and Afresh co-founder Volodymyr Kuleshov, who co-invented MDLM and Block Diffusion.
We pioneered the application of diffusion to language, with world’s first (and only) commercially available dLLM, Mercury. We are currently deploying our large-scale diffusion LLMs at Fortune 500 companies. Diffusion is the technology behind today’s image and video AI, and we’re making it the standard for LLMs as well.
Our team includes engineers from AWS, Google DeepMind, Meta AI, Microsoft, HashiCorp, and OpenAI. Based in Palo Alto, CA, we are backed by top-tier venture capitalists, including Menlo Ventures, Mayfield, M12 (Microsoft’s venture fund), Snowflake Ventures, Databricks, and Innovation Endeavors, and by tech luminaries such as Andrew Ng, Andrej Karpathy, and Eric Schmidt.
If you are talented, innovative, and ambitious, come help us invent the future of AI.We are an equal opportunity employer and encourage candidates of all backgrounds to apply.
Scan to Apply
Just scan this QR code to apply from your phone.
Job Location
San Mateo, California, 94401, United States