We are searching for Lead Data Engineer, Health Data Platforms to support future opportunities. The person filling this role will build, scale, and maintain national-scale data pipelines, integrations, and storage solutions that power laboratory, research, and healthcare operations. They will lead the modernization and harmonization of diverse clinical data, ensuring interoperability, data quality, and compliance across platforms. This position will be based in Bethesda, MD. Architect and modernize robust, disease-agnostic data acquisition and ingestion pipelines for large-scale, heterogeneous healthcare data (e.g., EHRs, claims, registries, geospatial data). Design, implement, and maintain ETL/ELT pipelines across cloud platforms (AWS, Azure, GCP). Design and maintain scalable, reliable, and flexible applications using TypeScript, NodeJS, Angular, and RESTful web services to support data workflows. Integrate data sources including ELN, LIMS, sample tracking software, web-based portals, and REST APIs. Maintain and enhance data harmonization pipelines such as OMOP, improve interoperability among data models including OMOP, PCORNet, and FHIR, and ensure consistency and alignment for critical data types to support master data integration and harmonization. Implement and manage data storage solutions (data lakes, warehouses) utilizing the appropriate partitioning, security, and lifecycle policies. Champion data quality and governance standards through the development of sophisticated data quality frameworks, dashboards, and feedback loops to ensure transparency in data completeness, consistency, and quality for partners and researchers. Optimize pipeline performance, reliability, and cost by establishing monitoring and alerting functions. Innovate with advanced technologies: integrate new data sources (e.g., national mortality data, CMS), link datasets, and build processes for novel data types (geospatial, environmental). Collaborate with informatics, bioinformatics, and platform teams to define data models and SLAs. Provide technical leadership and mentorship Translate scientific needs into technical solutions in an agile, mission-focused environment. Implement CI/CD for data workflows and infrastructure-as-code (e.g., Terraform, ARM, CloudFormation). Document data architectures, lineage, and standards.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees