Data Engineer in United States at Jobgether
Explore Related Opportunities
Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Engineer in the United States.
This role focuses on building robust, scalable, and high-performance data systems that enable advanced analytics and data-driven decision-making at global scale. You will design and maintain modern data pipelines that integrate large and complex datasets from multiple sources into reliable storage and processing systems. Working closely with data scientists, ML engineers, and software developers, you will ensure data quality, accessibility, and consistency across the entire data lifecycle. The position involves both backend engineering and data architecture, with a strong emphasis on Python-based development and cloud-ready systems. You will contribute to improving data reliability, performance, and governance in environments handling mission-critical information. This is a high-impact role where your work directly supports innovation in science, healthcare, and enterprise analytics.
- Design, build, and maintain scalable ETL/ELT data pipelines across distributed and cloud-based environments.
- Integrate and transform data from multiple internal and external sources into data warehouses and data lakes.
- Develop and optimize backend data services using Python, ensuring performance, maintainability, and scalability.
- Collaborate with cross-functional teams including Data Scientists, ML Engineers, and Software Developers to support data-driven initiatives.
- Build and maintain database schemas, data models, and documentation to ensure consistency and usability.
- Implement data quality, governance, security, and compliance best practices across data systems.
- Monitor, troubleshoot, and improve data pipelines to ensure reliability, integrity, and performance.
- Strong experience in Python backend development, including building scalable APIs and services (FastAPI, Django, or Flask).
- Solid experience in data engineering, including ETL/ELT pipeline design and operation.
- Advanced SQL skills with experience in data modeling, optimization, and database design.
- Experience integrating data from APIs, databases, and large-scale or streaming systems.
- Familiarity with cloud platforms such as AWS, Azure, or GCP is highly desirable.
- Working knowledge of distributed systems, data processing frameworks, or large-scale data architectures.
- Experience with Docker, CI/CD pipelines, and optionally Kubernetes in production environments.
- Strong collaboration skills and ability to work effectively in cross-functional, remote teams.
- Nice to have: exposure to bioinformatics, life sciences, or healthcare data environments (e.g., clinical trials, CDISC, OMOP).
- Fully remote position with global team collaboration opportunities.
- Competitive compensation aligned with experience and technical expertise.
- Opportunity to work on high-impact projects in biotech, pharma, and enterprise analytics.
- Budget for professional development, certifications, and conferences.
- Access to modern engineering tools and environments (MacBook or ThinkPad with Linux setup).
- Strong culture of technical excellence, collaboration, and continuous learning.
- Exposure to cutting-edge data engineering challenges at large enterprise scale.