Data Engineer

CDC Foundation
11dRemote

About The Position

The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements. Working within the City of Worcester Division of Public Health Office of Data, Research, Epidemiology, and Innovation, the Data Engineer will focus on building and optimizing data pipelines, improving data integration across existing systems, and supporting standardized, reproducible data workflows. The role emphasizes infrastructure development, documentation, and sustainability rather than ad hoc analysis and reporting. In addition to technical data engineering responsibilities, this role requires the ability to accurately document workflow processes to support quality improvement efforts. The Data Engineer will be hired by the CDC Foundation and assigned to the City of Worcester Division of Public Health Office of Data, Research, Epidemiology, and Innovation. This position is eligible for a fully remote work arrangement for U.S. based candidates.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Data Science, Public Health Informatics or a related field.
  • Minimum 5 years of relevant professional experience.
  • Experience designing, building, and maintaining data pipelines in small- to mid-scale data environments.
  • Experience in implementing data automations within existing frameworks as opposed to writing one off scripts.
  • Experience regarding engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review.
  • Knowledge of data warehousing concepts and tools.
  • Ability to work with incomplete or evolving data sources common in public sector and public health settings.
  • Familiarity with agile development methodologies, software design patterns, and best practices.
  • Strong analytical thinking and problem-solving abilities.
  • Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively.
  • Flexibility to adapt to evolving project requirements and priorities.
  • Outstanding interpersonal and teamwork skills; and the ability to develop productive working relationships with colleagues and partners.
  • Experience working in a virtual environment with remote partners and teams.
  • Proficiency in Microsoft Office Suite.

Nice To Haves

  • Proficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, or SQL.
  • Strong understanding of database systems, including relational databases and structured datasets.
  • Expertise in data modeling, ETL (Extract, Transform, Load) processes, and data integration techniques.
  • Experience with cloud computing platforms.
  • Ability to document business practices and workflows, identify opportunities for improvement, support process improvement, discover issues and deliver improved value.
  • Ability to effectively communicate insights and plans to cross-functional team members and management.
  • Experience using data to make decisions, gathering data insights by design to improve outcomes.

Responsibilities

  • Design, build, and maintain scalable data pipelines to support data intake, transformation, and storage of public health data from multiple internal and external data sources.
  • Support integration of existing data systems (e.g. programmatic data) into standardized data environments.
  • Implement and maintain ETL/ELT processes that ensure data accuracy, completeness, consistency, and reproducibility.
  • Develop and maintain technical documentation, including data flow diagrams, data dictionaries, and pipeline documentation, to support long-term sustainability.
  • Implement basic data validation, monitoring, and error handling processes to identify and address data quality issues.
  • Support development of standardized data structures and naming conventions aligned with public health best practices and internal governance needs.
  • Collaborate closely with epidemiologists, program staff, and external partners to understand data requirements and translate public health needs into technical solutions.
  • Work in coordination with other CDC Foundation placements to ensure infrastructure supports in-scope project efforts such as analytic, visualization, and reporting needs.
  • Provide technical guidance and knowledge transfer to internal staff to support capacity building and continuity after the placement ends.
  • Implement appropriate data security and access controls consistent with public health data governance, privacy, and confidentiality requirements.
  • Apply best industry practices in data engineering while adapting solutions to the operational realities of a local public health department.
  • Stay informed of emerging tools and approaches relevant to public health data infrastructure and recommend improvements where appropriate.
  • Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.
  • Up to 10% domestic travel may be required.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service