Staff Applied Researcher, AI Quality in United States at Jobgether
Explore Related Opportunities
Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Staff Applied Researcher, AI Quality in the United States.
This is an exceptional opportunity for an experienced AI researcher to shape the future of intelligent developer tools and large-scale AI systems. The role focuses on designing advanced evaluation frameworks and quality methodologies that directly influence how developers interact with AI-powered coding experiences. Working at the intersection of applied research, machine learning engineering, and product innovation, the successful candidate will help drive improvements in code generation, reasoning systems, safety, and agentic workflows. This position offers a highly collaborative and fast-paced environment where research ideas are rapidly translated into production-ready solutions. The role also provides the opportunity to mentor technical teams, influence long-term AI strategy, and contribute to cutting-edge advancements in LLM evaluation and experimentation. Candidates passionate about AI quality, scalable systems, and impactful research will thrive in this remote-first environment.
- Design and implement advanced evaluation frameworks for large language models, including code generation, reasoning, multimodal capabilities, safety, and agentic workflows.
- Develop scalable evaluation methodologies such as automated metrics, reward models, LLM-judge systems, and human-in-the-loop evaluation pipelines.
- Build and optimize benchmarking systems, datasets, experimentation pipelines, and production-grade ML evaluation tooling.
- Collaborate closely with engineering, product, and design teams to integrate research findings into practical AI-powered applications and product experiences.
- Lead initiatives focused on improving model quality, alignment, and performance across AI systems and developer tools.
- Drive the onboarding and creation of challenging benchmarks for coding agents and advanced AI workflows.
- Mentor researchers and engineers, promoting high technical standards, innovation, and effective execution practices.
- Provide strategic guidance in ambiguous problem spaces and contribute to long-term AI quality and evaluation strategies.
- Bachelor’s, Master’s, or PhD degree in Computer Science, Data Science, Mathematics, Statistics, Physics, Economics, Operations Research, or a related technical field, or equivalent practical experience.
- Minimum of 4–8 years of experience in data science, machine learning, applied research, or related technical fields depending on educational background.
- Strong software engineering expertise in Python and/or TypeScript, with experience building scalable ML, data, or evaluation pipelines in production environments.
- Proven experience delivering research systems or AI evaluation frameworks in real-world production settings.
- Deep understanding of large language model evaluation, alignment, reward modeling, safety assessments, or AI quality methodologies.
- Experience with large-scale experimentation, benchmarking strategies, and online/offline model evaluation techniques.
- Strong communication and cross-functional collaboration skills, with the ability to influence technical and product decisions.
- Experience with developer tools, AI-assisted programming, or code generation systems is highly preferred.
- Open-source contributions or experience engaging with developer communities is considered a strong advantage.
- Competitive base salary ranging from $140,400 to $372,300 USD annually.
- Eligibility for annual bonus programs and equity or stock-based compensation opportunities.
- Fully remote work environment with flexibility to work from anywhere within the United States.
- Comprehensive healthcare, dental, and wellness benefits.
- Generous paid time off and work-life balance support.
- Professional learning, career development, and growth opportunities.
- Access to cutting-edge AI research initiatives and high-impact technical projects.
- Inclusive, collaborative, and innovation-driven company culture.
- Opportunities to work alongside world-class engineers, researchers, and product leaders.
- Supportive environment focused on diversity, inclusion, and employee well-being.