We're looking for a Senior Data Engineer to join the Scientific Data Intelligence (SDI) team at Formation Bio to help transform Real World Data (RWD)—spanning electronic health records, claims, and other longitudinal patient data sources—into structured, analytics-ready assets. In this role, you'll be partnering closely with our Data Science team not only to model and transform data, but also to actively analyze it: answering research questions, generating evidence, and supporting scientific decision-making across our drug portfolio. This position sits at the intersection of healthcare data engineering, real-world evidence analysis, and generative AI. While a strong foundation in building reliable, scalable pipelines is essential, you'll be equally expected to roll up your sleeves and work directly with the data—constructing cohorts, running analyses, and translating findings into actionable insights for scientific and business stakeholders. The ideal candidate is a hybrid of data engineer and applied scientist: someone who can build the infrastructure and then use it, with familiarity in RWD study design, GenAI fluency (e.g., LLM-based entity extraction, summarization, classification), and strong technical expertise with modern data tooling. You'll play a key role in shaping how real-world patient data becomes discoverable, structured, and impactful across the organization.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
1-10 employees