Data Engineer in Orlando, Florida at VAXCARE LLC
Explore Related Opportunities
Job Description
Are you looking for a data engineering role that can meaningfully impact the physical health of the United States' population for the next 20 years? Do you want to write software that transforms how healthcare vaccinates? VaxCare is a vaccine dispensing platform that leverages proprietary technology to improve immunization rates and overall vaccine program profitability. We are problem solvers at heart and are constantly looking to develop innovative tools and solutions that help accomplish our vision of every person fully vaccinated.
THE POSITIONYou'll be a key member of VaxCare's Product Group, joining our Data Engineering team and reporting to our Data Engineering Lead. We are seeking a motivated and capable Data Engineer to join our team. As a Data Engineer, you will contribute to the design, development, and management of our data processing and analytics infrastructure. The ideal candidate will have hands-on experience working with Spark and Databricks, a solid foundation in data engineering principles, and a desire to grow into a senior technical contributor.
RESPONSIBILITIESEducation:
- Bachelor's degree in Computer Science, Data Engineering, Engineering, or related technical field OR equivalent practical experience
- Master's degree or relevant industry certifications (Databricks Certified Data Engineer Associate, Azure Data certifications) are a plus
Experience:
- Must be located in the Greater Orlando / Boston area.
- 3-5 years of data engineering experience with 1+ years hands-on production experience building data pipelines on Databricks and Apache Spark
- Experience contributing to lakehouse architecture implementations
Technical Skills:
Programming & Languages:
- Strong proficiency in Python (PySpark, pandas) and SQL (complex queries, window functions, CTEs, query optimization)
- Experience with Spark SQL, Delta Lake SQL, and Databricks SQL
Apache Spark Expertise:
- Working knowledge of Apache Spark including:
- Performance fundamentals (partitioning, broadcast joins, data skew handling, caching strategies)
- Delta Lake features (ACID transactions, time travel, MERGE operations, CDC, liquid clustering)
Databricks Platform:
- Hands-on experience with Databricks including:
- Delta Live Tables (DLT) for declarative pipeline development
- Unity Catalog for data governance, access control, and lineage tracking
- Databricks Workflows and orchestration
- Basic understanding of cluster configuration and cost-aware compute selection
- Databricks SQL and Lakeview dashboards
Data Architecture & Modeling:
- Solid understanding of data modeling techniques:
- Dimensional modeling (star schema, fact/dimension tables)
- Medallion architecture (bronze/silver/gold layers)
- Slowly Changing Dimensions (SCD) implementations
- Strong SQL skills including query optimization and performance tuning
- Familiarity with modern lakehouse patterns and understanding of lakehouse vs. traditional data warehouse trade-offs
DevOps & DataOps:
- Familiarity with DevOps/DataOps practices:
- Git workflows (branching strategies, pull requests, code reviews)
- CI/CD pipelines for data workflows (GitHub Actions, Azure DevOps, Jenkins)
- Testing strategies (unit tests, integration tests, data quality tests)
- Basic monitoring and observability (logging, alerting)
Collaboration & Growth:
- Works independently to deliver high-quality, well-tested solutions with meaningful impact on the team's data infrastructure
- Takes ownership of assigned projects and drives them to completion with minimal oversight
- Strong communication and collaboration skills in cross-functional team environments
- Proactive in identifying problems and proposing solutions, even outside immediate area of responsibility
- Demonstrates initiative in expanding technical depth and breadth, with a trajectory toward senior-level engineering
- Open to feedback and committed to continuous improvement