Technical Program Manager, AI Evaluation Specialist at Jobgether – United States
Jobgether
United States, United States
Posted on
NewJob Function:Human Resources
New job! Apply early to increase your chances of getting hired.
About This Position
Technical Program Manager, AI Evaluation Specialist
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Technical Program Manager, AI Evaluation Specialist in United States.This role is perfect for a detail-oriented and analytical professional passionate about ensuring the safety, reliability, and quality of AI systems in operational settings. The Technical Program Manager, AI Evaluation Specialist, will lead the human-in-the-loop evaluation process, measuring AI model performance, identifying errors, and recommending improvements to enhance accuracy and trust. You will collaborate closely with data, operations, and model teams to standardize evaluation protocols, analyze patterns, and ensure AI outputs meet organizational standards. This position combines technical rigor with project management, offering the chance to influence model governance at scale. You will track metrics, maintain documentation, and help implement insights into operational workflows. The role requires strong problem-solving, communication, and organizational skills, and operates in a collaborative, fast-paced environment.Accountabilities:
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Technical Program Manager, AI Evaluation Specialist in United States.This role is perfect for a detail-oriented and analytical professional passionate about ensuring the safety, reliability, and quality of AI systems in operational settings. The Technical Program Manager, AI Evaluation Specialist, will lead the human-in-the-loop evaluation process, measuring AI model performance, identifying errors, and recommending improvements to enhance accuracy and trust. You will collaborate closely with data, operations, and model teams to standardize evaluation protocols, analyze patterns, and ensure AI outputs meet organizational standards. This position combines technical rigor with project management, offering the chance to influence model governance at scale. You will track metrics, maintain documentation, and help implement insights into operational workflows. The role requires strong problem-solving, communication, and organizational skills, and operates in a collaborative, fast-paced environment.Accountabilities:
- Own the human-in-the-loop evaluation process for AI models supporting operations, ensuring consistent and accurate assessments.
- Conduct recurring sampling and detailed reviews to assess model accuracy, consistency, and failure modes.
- Score, tag, and document instances where AI systems misclassify, hallucinate, or generate incomplete outputs.
- Maintain rubrics, guidelines, and documentation to ensure evaluator alignment and scoring consistency.
- Investigate error patterns and root causes, translating insights into actionable recommendations for model owners and partner teams.
- Track and report evaluation metrics, such as accuracy, recall, coverage, and error types, and integrate findings into dashboards and workflows.
- Support scaling of governance processes and strengthen model-health standards across operations.
- 35+ years of experience in QA, evaluation, operational analytics, human-in-the-loop programs, or model monitoring.
- Experience reviewing unstructured text and applying rubrics or scorecards for qualitative and quantitative assessment.
- Understanding of AI applications in operations, including classification, summarization, categorization, and automation.
- Strong analytical skills with the ability to identify patterns, edge cases, and failure modes.
- Familiarity with QA frameworks or content-review workflows.
- Experience with SQL, Looker, or Snowflake is a plus.
- Exceptional attention to detail and consistency in work.
- Clear communication and documentation skills.
- Passion for ensuring AI systems are safe, fair, and reliable.
- COPC or Lean Six Sigma experience is a plus.
- Competitive base salary starting at $103,680$144,000 annually, plus potential bonus and equity opportunities.
- Comprehensive medical, dental, vision, life, and disability insurance.
- 401(k) retirement plan with company match.
- Flexible vacation and paid time off policies.
- Paid parental leave for birthing and non-birthing parents.
- Wellness stipends and support for family planning services.
- Opportunities for both in-person and virtual team engagement and professional development activities.
- Remote-first work environment with occasional on-site collaboration as needed.
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1
Scan to Apply
Just scan this QR code to apply from your phone.
Job Location
United States, United States