Senior Data Engineer (AI Native) in United States at Jobgether
Explore Related Opportunities
Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Data Engineer (AI Native) in the United States.
Join a fast-scaling, remote-first organization building data infrastructure that powers services used by tens of millions of people worldwide. In this role, you will design and operate large-scale, high-throughput data systems that process billions of daily events and transform them into reliable, analytics-ready datasets. You will work at the intersection of distributed systems, cloud engineering, and AI-assisted development, helping to shape how modern data platforms are built. The environment is highly collaborative and innovation-driven, with a strong emphasis on AI-native engineering practices and automation. You will partner closely with analytics, data science, and product teams to enable data-driven decision-making at scale. This role offers the opportunity to work on mission-critical pipelines while actively contributing to next-generation AI-enabled data tooling and workflows. Your work will directly improve platform reliability, performance, and the speed at which teams can access actionable insights.
- Design, build, and maintain scalable data platforms supporting real-time analytics, batch processing, and exploratory data use cases.
- Own end-to-end data pipelines, including ingestion, transformation, storage, and serving layers across large-scale distributed systems.
- Develop and optimize streaming and batch pipelines using technologies such as Kafka, Kinesis, Databricks, Spark, AWS, and Airflow (MWAA).
- Architect and maintain medallion (Bronze/Silver/Gold) data models to ensure clean, consistent, and well-governed datasets.
- Implement robust data quality frameworks, automated testing, monitoring, and CI/CD pipelines to ensure reliability and correctness.
- Build AI-powered engineering workflows, including prompts, automation scripts, and tooling that accelerate pipeline development and documentation.
- Develop and maintain natural language data interfaces and chatbot solutions (e.g., Databricks Genie) for non-technical users.
- Collaborate with analytics engineering, data science, and product teams to transform complex datasets into production-ready insights.
- Contribute to infrastructure-as-code initiatives (Terraform and related tools) for scalable cloud resource provisioning and management.
- 5+ years of experience in high-volume data engineering or distributed data systems.
- Strong expertise in Databricks, AWS (S3, EMR, Kinesis/Kafka), Apache Spark/Spark Streaming, Airflow, SQL, and Python (Scala or Java is a plus).
- Proven experience building and maintaining large-scale batch and streaming data pipelines in production environments.
- Solid understanding of data modeling (logical and physical), SQL optimization, and performance tuning.
- Hands-on experience with data quality and validation frameworks (e.g., Great Expectations or similar tools).
- Familiarity with Infrastructure-as-Code tools such as Terraform and cloud infrastructure best practices.
- Demonstrated ability to work effectively in AI-native environments using LLM tools for code generation, review, and workflow automation.
- Strong ability to evaluate, refine, and take ownership of AI-generated code and outputs before production deployment.
- Excellent communication skills with the ability to work independently in a remote-first environment.
- Bachelor’s degree in Computer Science, Engineering, Mathematics, or equivalent practical experience.
- Competitive salary: $103,500 – $192,000 USD depending on experience and location.
- Equity participation as part of the compensation package.
- Comprehensive health benefits including medical, dental, vision, life, and disability coverage (fully paid for US-based employees).
- Retirement plans including 401(k) with employer match.
- Employee Assistance Program (EAP) supporting mental health and wellbeing.
- Flexible PTO policy and company-wide paid time off days throughout the year.
- Learning and development programs to support continuous growth.
- Remote-first work environment with home office equipment and support.
- Access to premium membership benefits for family safety and coordination services.