JobTarget Logo

Senior Platform Engineer — AI Agent Infrastructure in Spain at Jobgether

NewJob Function: Information Technology
Jobgether
Spain, Spain
Posted on
New job! Apply early to increase your chances of getting hired.

Explore Related Opportunities

Job Description

Senior Platform Engineer AI Agent Infrastructure

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Platform Engineer — AI Agent Infrastructure in Spain.

This is a high-impact engineering opportunity to shape the infrastructure behind a rapidly growing AI agent platform operating at global scale. You will take ownership of core cloud architecture, ensuring systems remain reliable, observable, and ready for continuous expansion. The role combines deep infrastructure expertise with strong architectural thinking, making it ideal for someone who enjoys building scalable distributed systems rather than simply maintaining existing environments. You will lead decisions around messaging systems, automation, deployment pipelines, and platform resilience. Working in a remote and innovation-driven environment, you will collaborate with high-performing teams using modern cloud and AI technologies. This is an opportunity to directly influence the future of AI infrastructure in production environments.

Accountabilities:
  • Own and evolve the cloud infrastructure supporting AI agents running at scale in production environments
  • Design and implement event-driven architectures using durable asynchronous messaging systems
  • Improve inter-service communication by replacing synchronous dependencies with scalable messaging patterns
  • Build and maintain infrastructure as code frameworks for provisioning, deployment, and environment consistency
  • Ensure platform reliability, scalability, and performance across distributed workloads
  • Develop advanced observability capabilities including dashboards, alerts, tracing, logging, and health monitoring
  • Lead incident response analysis and proactively improve system resilience based on production learnings
  • Evaluate emerging technologies and drive architectural decisions as the platform matures
  • Optimize databases, storage systems, and caching layers for speed, availability, and cost efficiency
  • Collaborate with engineering teams to support secure and efficient deployment of AI workloads

Requirements:

  • 4+ years of experience in platform engineering, infrastructure engineering, SRE, or backend systems roles
  • Strong expertise in event-driven architecture and messaging systems such as Kafka, RabbitMQ, NATS, or similar
  • Deep AWS experience including EC2, VPC, IAM, S3, RDS, and internal networking concepts
  • Solid experience with SQL databases such as PostgreSQL and NoSQL systems such as MongoDB or Redis
  • Strong Docker knowledge including container lifecycle management, health checks, resource limits, and image optimization
  • Proven experience debugging distributed systems, asynchronous flows, and cascading production failures
  • Hands-on experience with Infrastructure as Code tools such as Terraform or Pulumi
  • Strong observability skills using Datadog or equivalent tools for APM, logging, monitoring, and tracing
  • Experience with Go or similar backend programming languages
  • Strong communication skills and ability to lead technical decisions in remote teams

Preferred Qualifications:

  • Experience supporting AI or MLOps infrastructure, model serving, LLM inference, or GPU workloads
  • Familiarity with LangFuse, LangSmith, Braintrust, MLflow, or similar AI observability tools
  • Experience building multi-tenant container platforms or internal PaaS environments
  • Kubernetes migration or production operations experience
  • Exposure to Airflow, Prefect, Snowflake, BigQuery, Databricks, or similar data platforms
  • ECS and AI agent framework ecosystem knowledge is a plus

Benefits:

  • Competitive compensation package
  • Fully remote work from anywhere
  • One-time home office setup allowance
  • Company-provided work equipment
  • Stock options
  • Health plan coverage regardless of location
  • Flexible paid time off
  • Language learning and professional development courses
  • Personal growth and continuous learning support
  • Opportunity to shape cutting-edge AI infrastructure at scale
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1

Job Location

Spain, Spain

Frequently asked questions about this position

Continue to apply
Enter your email to continue. You’ll be redirected to the employer’s application.
By clicking Continue, you understand and agree to JobTarget's Terms of Use and Privacy Policy.