Can I apply directly for this job on this page?

Yes, you can begin your application on this page using a quick form. You'll then be redirected to the employer's career site to complete the full application process.

Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab job near me in Canada Creek, Nova Scotia at Jobgether | Jobs and Employment

Research Lead / Principal Scientist & Manager Post-Training Alignment Reinforcement Learning Autodesk AI Lab

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Research Lead / Principal Scientist & Manager – Post-Training · Alignment · Reinforcement Learning in Canada.

This role sits at the forefront of frontier AI research, focusing on transforming foundation models into reliable, aligned, and domain-ready systems. You will lead a growing team of AI scientists while remaining deeply hands-on in research, shaping post-training strategies that include reinforcement learning, preference optimization, and agentic reasoning systems. Operating within a highly advanced AI research environment, you will influence both long-term research direction and real-world product impact across industries such as architecture, engineering, manufacturing, and media. The position blends scientific leadership with execution, requiring strong judgment in model behavior, evaluation design, and alignment trade-offs. You will collaborate closely with infrastructure, product, and research teams to ensure scalable and reproducible training workflows. This is a high-impact leadership role where your work directly contributes to advancing trustworthy AI systems used in real-world professional workflows.

Accountabilities

You will define and lead the post-training and alignment research strategy, overseeing how foundation models are refined into robust, safe, and high-performing systems. You will guide both technical direction and team execution while staying actively engaged in experimentation and algorithm development.

Lead post-training strategy across RLHF, preference optimization, and reinforcement learning for complex reasoning systems
Develop novel algorithms to improve model alignment, controllability, reliability, and domain-specific performance
Design and execute experiments to evaluate model behavior, robustness, reasoning quality, and safety
Establish evaluation frameworks for long-horizon reasoning, agentic behavior, and real-world workflow completion
Define model readiness criteria and provide go/no-go recommendations for deployment
Manage, mentor, and grow a team of AI researchers while fostering a high-rigor scientific culture
Collaborate with infrastructure and product teams to build scalable and reproducible training systems
Contribute to publications, patents, and external research visibility in top-tier ML venues
Translate technical findings into clear guidance for leadership and cross-functional stakeholders

Requirements

This role requires deep expertise in reinforcement learning and foundation model post-training, combined with proven research leadership experience. You should bring strong intuition for model behavior, alignment challenges, and large-scale AI system trade-offs.

Extensive hands-on experience with reinforcement learning and post-training methods (RLHF, RLAIF, PPO, DPO, or similar)
Proven experience leading or mentoring AI research teams in industry or academic settings
Strong understanding of alignment challenges, model evaluation, and reasoning systems
Experience designing rigorous evaluation frameworks for AI model performance and readiness
Ability to communicate complex technical concepts and trade-offs to diverse audiences
Background in ML, AI, or RL research, typically supported by a PhD or equivalent industry research experience
Preferred experience in frontier AI labs, agentic AI, or alignment research
Familiarity with large-scale training infrastructure and production AI systems is an asset
Strong publication record in top ML venues is highly valued

Benefits

Competitive compensation package including base salary, bonus, and equity components
Comprehensive health, dental, and vision coverage
Inclusive and collaborative research environment focused on real-world impact
Flexible work arrangements, including remote options across North America and Europe
Strong emphasis on research freedom, publication, and external visibility
Professional development opportunities and access to cutting-edge AI infrastructure
Paid time off, wellness programs, and employee support initiatives
Opportunity to influence frontier AI systems used in high-impact industrial domains

How Jobgether works:

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1

Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab in Canada Creek, Nova Scotia at Jobgether

Explore Related Opportunities

Job Description

Scan to Apply

Job Location

Frequently asked questions about this position

Similar Jobs In Canada Creek, Nova Scotia

AI Research Engineer - Reinforcement Learning

Senior Applied Scientist, Search & Information Retrieval