AI Data Engineer--Peptides and Biologics

PfizerBerlin, MA
1d$106,000 - $171,500Hybrid

About The Position

Our cross-functional Data Ecosystem Team is looking to identify a forward-deployed data and AI engineer to help build and scale an AI-ready data architecture supporting biologics labs. You will leverage your expertise to design innovative software solutions that extract valuable insights from Pfizer's proprietary data and external datasets, enabling the generation of testable hypotheses across the entire drug discovery value chain.

Requirements

  • PhD in Biology, Chemistry, Physics, Statistics or a related technical discipline OR Master’s degree and 2+ years of experience building AI powered research applications
  • Strong background in data handling, integration and analysis
  • Thorough understanding of drug discovery and biology with a particular focus on large molecule therapeutics such as peptides, siRNA, antisense, mRNA and antibodies.
  • Research experience developing data products and data integration solutions as well as a sincere interest for computational life sciences
  • Experience solving complex analyses/problems in a timely fashion
  • Exceptional programming skills in Python
  • Strong experience as a full-stack developer with focus on python, in-depth database expertise with a focus on postgres, ETL frameworks.
  • Strong communication skills—verbal, written and presentation

Nice To Haves

  • Nextflow pipeline development
  • Proficiency in front-end technologies such as typescript, reactjs and browser-based visualization techniques is a plus
  • Proficiency in utilizing AI/ML libraries including PyTorch and Lightning
  • Experience with LLMs/RAG systems
  • Expertise in software engineering, package development, cloud architectures, CI/CD and software engineering tooling
  • Familiarity with pertinent libraries within the Python scientific stack
  • Hands-on experience handling, processing, integrating, and analyzing large heterogenous data sets data in a drug discovery research environment
  • Experience with Claude Code or equivalent and vibe coding paradigms
  • Strong publication record and demonstrated contributions to the field
  • Experience taking ideas from prototype to production.

Responsibilities

  • Development, support and implementation of a modern data platform to enable efficient and scalable correlation and analysis of data for biological drug modalities.
  • Development of innovative data products and machine learning methods for biologics data together with machine learning experts within Pfizer
  • Processing, analysis and integration of internal in vivo pharmacodynamics and toxicology data sets
  • Curation and integration of relevant datasets from the public domain
  • Development of analysis pipelines
  • Development and roll out of data products to meet specific needs through data integration
  • Implementation, testing and validation of new methods for data analysis and visualization techniques
  • Drive collaborations with external companies and academic institutions
  • Develop Pfizer biologics data capture, metadata tagging and storage strategy along with Pfizer’s Digital organization
  • Onboarding of Pfizer colleagues to the data platform and organization of workshops, hackathons, trainings and scientific talks
  • Strengthen external visibility and scientific excellence through publishing / presenting work in reputed journals and conference/workshop venues and engaging with the scientific community

Benefits

  • a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution
  • paid vacation, holiday and personal days
  • paid caregiver/parental and medical leave
  • health benefits to include medical, prescription drug, dental and vision coverage
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service