Deep Learning Engineer, LLM Accuracy Evaluation at Jobgether – Spain
Explore Related Opportunities
About This Position
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Deep Learning Engineer, LLM Accuracy Evaluation in Spain.
Join a cutting-edge engineering team focused on advancing how next-generation AI models are evaluated and optimized. In this role, you will work at the intersection of deep learning research and scalable infrastructure, helping define new standards for assessing the accuracy and reliability of large language models, retrieval-augmented systems, and multimodal architectures. You will collaborate with global partners and open-source communities to bring high-performance AI models into production-ready environments. With access to powerful computing resources and emerging technologies, you will contribute directly to shaping the future of AI systems. This position offers a fast-paced, innovation-driven environment where experimentation, technical excellence, and impact go hand in hand.
- Lead the development of advanced methodologies to evaluate the performance, accuracy, and robustness of deep learning models, including LLMs, RAG systems, and vision models
- Collaborate with internal teams and external partners to optimize and deploy flagship AI models as high-performance inference services
- Design, build, and maintain scalable tools, pipelines, and infrastructure supporting AI evaluation and benchmarking initiatives
- Analyze and improve AI frameworks, libraries, and APIs to ensure alignment with best practices and performance standards
- Conduct experiments and research to validate new evaluation techniques and contribute to continuous model improvement
- Support cross-functional initiatives by translating complex technical findings into actionable insights
Requirements:
- Advanced degree (BS, MS, or PhD) in Computer Science, Artificial Intelligence, Applied Mathematics, or a related field, or equivalent experience
- Extensive hands-on experience (10+ years) in AI development, particularly in NLP and large language models
- Strong expertise in deep learning algorithms, mathematical modeling, and performance evaluation techniques
- Proven ability in debugging, testing, and optimizing large-scale AI systems
- Experience with inference and deployment technologies such as TensorRT, ONNX, or Triton is highly desirable
- Familiarity with MLOps/DevOps practices, containerization (Docker), and Linux-based environments
- Experience working with large-scale computing environments or HPC clusters is a plus
- Excellent communication skills and ability to collaborate effectively in a fast-paced, global environment
Benefits:
- Opportunity to work on state-of-the-art AI technologies with access to cutting-edge hardware and infrastructure
- Flexible and remote-friendly working environment within Spain
- Exposure to global collaborations with leading AI researchers and engineers
- Continuous learning and development opportunities in a rapidly evolving field
- Competitive compensation package with performance-based incentives
- Inclusive and diverse workplace culture that fosters innovation and creativity