Can I apply directly for this job on this page?

Yes, you can begin your application on this page using a quick form. You'll then be redirected to the employer's career site to complete the full application process.

What type of employment is offered for this AI Research Engineer (Multi-Modal Reinforcement Learning) role?

Full-time or part-time position

What is the expected salary for this AI Research Engineer (Multi-Modal Reinforcement Learning) job?

Compensation will be discussed during the hiring process.

AI Research Engineer (Multi-Modal Reinforcement Learning) job near me in Spain, Other / Non-US at Jobgether

AI Research Engineer (Multi-Modal Reinforcement Learning)

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a AI Research Engineer (Multi-Modal Reinforcement Learning) in Spain.

This role sits at the intersection of cutting-edge AI research and large-scale system engineering, focusing on advancing multi-modal reinforcement learning across text, image, audio, and complex simulated environments. You will contribute to the design of next-generation intelligent systems capable of adaptive decision-making in real-world scenarios. Working in a highly research-driven, globally distributed environment, you will help build and scale reinforcement learning frameworks that power advanced multimodal models. Your work will directly influence model performance, training stability, and reward optimization strategies at scale. You will collaborate with top-tier researchers and engineers to push the boundaries of AI capabilities. The role combines deep theoretical research with hands-on system development and experimentation. It is ideal for someone passionate about foundational AI breakthroughs and real-world deployment impact.

Accountabilities

In this role, you will lead research and engineering efforts across multi-modal reinforcement learning systems while contributing to scalable AI infrastructure and experimentation frameworks. You will be responsible for advancing model performance and robustness through innovative algorithm design and rigorous evaluation practices.

Conduct research on reinforcement learning methods for multi-modal systems, including diffusion-based and autoregressive model approaches.
Design and build scalable RL infrastructure supporting distributed training and evaluation across complex multi-modal environments.
Develop reward modeling strategies to improve alignment, training stability, and mitigate failure modes such as reward hacking.
Create and curate simulation environments and datasets for training, benchmarking, and validating multi-modal RL models.
Design and execute evaluation protocols to measure performance improvements and ensure reproducibility across experiments.
Analyze model behavior across modalities, identifying bottlenecks in optimization, exploration, and cross-modal alignment.
Explore and develop next-generation RL paradigms to enhance learning from environment feedback and improve SOTA performance.
Publish research in leading AI conferences such as NeurIPS, ICML, ICLR, CVPR, and related venues.

Requirements

The ideal candidate has a strong academic and practical background in machine learning, reinforcement learning, and multi-modal AI systems, with a proven record of research excellence and scalable system development. You are comfortable working at the frontier of AI research while building production-grade experimentation pipelines.

Master’s degree in Computer Science or related field required; PhD preferred in ML, CV, NLP, or AI-related disciplines.
Strong publication record in top-tier AI conferences (NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV).
Proven experience in large-scale reinforcement learning experiments, particularly in multi-modal or vision-centric systems.
Deep understanding of reinforcement learning theory, optimization, and policy learning in high-dimensional environments.
Strong hands-on experience with PyTorch and deep learning frameworks for multimodal AI systems.
Experience building end-to-end RL pipelines including simulation, training, evaluation, and deployment.
Ability to address core RL challenges such as sample efficiency, exploration-exploitation trade-offs, and training stability.
Strong analytical and problem-solving skills with a research-driven, experimental mindset.

Benefits

Competitive compensation package aligned with top-tier AI research talent
Fully remote, global-first work environment
Opportunity to work on frontier AI research problems at scale
High-impact role influencing next-generation multimodal intelligence systems
Collaboration with leading researchers and engineers in AI and reinforcement learning
Access to large-scale experimentation infrastructure and research resources
Strong culture of innovation, autonomy, and research publication support

How Jobgether works:

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1

AI Research Engineer (Multi-Modal Reinforcement Learning) in Spain at Jobgether

Explore Related Opportunities

Job Description

Scan to Apply

Job Location

Frequently asked questions about this position