Senior ML/AI Cheminformatics Scientist

PfizerGroton, CT
$93,600 - $156,000Hybrid

About The Position

As a Senior Cheminformatics Data Scientist within the Computational Absorption, Distribution, Metabolism, and Excretion (cADME) group in Pfizer’s Pharmacokinetics, Dynamics, and Metabolism (PDM) organization, you will apply data-driven approaches to address critical challenges in small-molecule drug discovery. In this role, you will develop, deploy, and continuously improve machine learning models for molecular property prediction and leverage Pfizer’s proprietary chemical and biological datasets to inform decision-making across hit-to-lead and lead optimization stages. Your work will directly enable more efficient compound prioritization, improved prediction of ADME properties, and accelerated progression of discovery programs. Success in this position will be defined by the delivery of robust, validated computational models and workflows that generate actionable insights and measurable impact on project outcomes. You will collaborate closely with cross-functional PDM partners and external collaborators to integrate AI-driven approaches into discovery workflows and maximize the value of Pfizer’s industry-leading data assets.

Requirements

  • PhD in a relevant field OR Relevant Master's degree with 2+ years of experience in Cheminformatics, Computational Chemistry, Bioinformatics, Data Science, or a related field.
  • Proficiency in Python, with experience with cheminformatics libraries and topics (RDKit, OpenEye, DeepChem, InChI, SMILES/SMARTS).
  • Experience working in a structured collaborative data and software development environment (git).
  • Experience using SAR model-building software such as SciKit-Learn, Torch, etc.
  • Strong understanding of basic statistical concepts and how to evaluate and track model performance.
  • Ability to perform mathematical calculations and ability to perform complex data analysis.
  • Permanent work authorization in the United States.

Nice To Haves

  • Experience using DAGs to define data pipelines, e.g., Luigi, Airflow, Dagster, Prefect, Kubeflow.
  • Experience working with relational (SQL) and NoSQL databases for chemical data management.
  • Experience working with MLOps tools such as MLFlow, Neptune, or SageMaker.

Responsibilities

  • Develop and apply cheminformatics data-mining algorithms for the analysis of molecular modeling, virtual screening, and structure-activity relationship (SAR) analysis.
  • Process and analyze chemical and biological datasets to evaluate properties such as drug-likeness, ADMET properties, dosing, etc.
  • Utilize molecular fingerprints, descriptors, and similarity metrics for compound clustering, classification, and regression.
  • Assist in development, maintenance, and monitoring of AI/ML models used for property prediction.
  • Optimize data pipelines for processing and storing chemical structures using cheminformatics toolkits (e.g., RDKit, OpenEye,).
  • Collaborate with cross-functional teams to integrate computational approaches into curation and analysis workflows.
  • Maintain best practices in data integrity, reproducibility, and documentation of QSAR AI/ML models.

Benefits

  • 401(k) plan with Pfizer Matching Contributions
  • Additional Pfizer Retirement Savings Contribution
  • Paid vacation, holiday and personal days
  • Paid caregiver/parental and medical leave
  • Health benefits to include medical, prescription drug, dental and vision coverage
  • Relocation support available
  • Relocation assistance may be available based on business needs and/or eligibility
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service