AI Data Engineer--In Vivo Data

PfizerBerlin, MA
4d$106,000 - $171,500Hybrid

About The Position

As a member of our cross-functional Data Ecosystem Team, you will help build and scale an AI-ready data architecture supporting In-Vivo biology labs. In this role, you will leverage your expertise to design innovative software solutions that extract valuable insights from Pfizer's proprietary data and external datasets, enabling the generation of testable hypotheses across the entire drug discovery value chain.

Requirements

  • PhD in Biology, Pharmacology, Toxicology, Computer Science, Physics, Statistics, or a related technical discipline OR Master’s degree and 2+ years of experience building AI powered research applications
  • Experience in In-Vivo Pharmacology
  • Strong background in data handling, integration and analysis
  • Thorough understanding of drug discovery and biology with a particular focus on in vivo / in vitro translational research.
  • Research experience in developing data products and data integration solutions
  • Experience solving complex analyses/problems in a timely fashion
  • Exceptional programming skills in Python
  • Strong full-stack development experience with focus on python, in-depth database expertise with a focus on postgres and ETL frameworks
  • Strong communication skills—verbal, written, and presentation

Nice To Haves

  • Nextflow pipeline development experience
  • Hands-on experience handling, processing, integrating, and analyzing large heterogenous data sets data in a drug discovery research environment
  • Proficiency in front-end technologies such as typescript, reactjs and browser-based visualization techniques
  • Proficiency utilizing AI/ML libraries including PyTorch and Lightning is a plus
  • Experience with LLMs/RAG systems
  • Proven expertise in software engineering, package development, cloud architectures, CI/CD and software engineering tooling
  • Familiarity with pertinent libraries within the Python scientific stack
  • Experience with Claude Code or equivalent and vibe coding paradigms
  • Strong publication record and demonstrated contributions to the field
  • Experience taking ideas from prototype to production

Responsibilities

  • Development and implementation of a data platform to enable efficient and scalable correlation and analysis of in vitro and in vivo data
  • Development of innovative data products and machine learning methods for data to support translational studies together with machine learning experts within Pfizer
  • Processing, analysis and integration of internal in vivo pharmacodynamics and toxicology data sets
  • Curation and integration of relevant datasets from the public domain
  • Development of data analysis pipelines
  • Development and roll out of data products engineered to meet specific data access patterns
  • Implementation, testing and validation of new methods for data analysis and visualization techniques
  • Drive collaborations with external companies and academic institutions
  • Develop Pfizer in vivo data capture, metadata tagging and storage strategy along with Pfizer’s Digital organization
  • Onboarding of Pfizer colleagues to the data platform and organization of workshops, hackathons, trainings and scientific talks
  • Strengthen external visibility and scientific excellence through publishing / presenting work in reputed journals and conference/workshop venues and engaging with the scientific community

Benefits

  • participation in Pfizer’s Global Performance Plan with a bonus target of 15.0% of the base salary and eligibility to participate in our share based long term incentive program
  • 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution
  • paid vacation, holiday and personal days
  • paid caregiver/parental and medical leave
  • health benefits to include medical, prescription drug, dental and vision coverage
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service