Statistical Data Scientist

Praxis Precision Medicines, Inc.
$180,000 - $200,000Remote

About The Position

Praxis is seeking an Associate Director, Statistical Data Scientist to lead and execute the development, validation and automation of analytical pipelines and statistical models that support metadata-driven clinical data processing, reporting, and regulatory submissions. This is a hands-on technical leadership role, ideal for a senior data scientist or statistical programmer who enjoys coding, problem-solving, and working cross-functionally to bring rigor, reproducibility, and automation to clinical reporting workflows. The Associate Director will contribute directly to R/Python development while also setting technical standards, mentoring peers, and ensuring readiness for R-based regulatory submissions.

Requirements

  • Bachelor’s or Master’s degree in Statistics, Biostatistics, Data Science, or a related field.
  • 8+ years of statistical programming experience in the pharmaceutical/biotech industy including hands-on experience with R and/or Python.
  • Proven experience preparing or supporting R-based regulatory submissions (e.g., R package validation, R-based analysis delivery, or submission readiness)
  • Strong understanding of CDISC ADaM and SDTM data structures, and their use in analytical workflows
  • Experience developing and validating reusable R/Python libraries and functions
  • Proficiency with Git, Bitbucket, and CI/CD automation pipelines
  • Working knowledge of GxP and Part 11 compliance
  • Excellent documentation and validation practices
  • Collaborative and proactive mindset; able to operate independently in a small, agile team.

Nice To Haves

  • Familiarity with YAML/JSON configuration and metadata-driven programming workflows
  • Prior experience migrating from SAS to R/Python environments
  • Knowledge of R validation frameworks (e.g., risk-based testing, reproducibility documentation).
  • Experience with exploratory analytics or visualization in R or Python within a regulated framework.

Responsibilities

  • Lead the design, development, and validation of R/Python code to automate generation of analytical datasets and TLFs within a metadata-driven pipeline.
  • Translate SAPs and metadata specifications (YAML/CSV) into executable and reproducible code.
  • Build and validate R packages and data science tools supporting both exploratory and confirmatory analyses, ensuring full traceability and audit readiness.
  • Implement and validate statisticall models (e.g., MMRM, ANCOVA, logistic regression) using R packages such as mmrm, emmeans.
  • Collaborate with IT to integrate data science and statistical programming workflows within Databricks and CI/CD pipelines for continuous validation and reproducibility
  • Collaborate across programming, biostatistics, and data standards functions to ensure dataset definitions, derivations, and metadata align with controlled standards.
  • Conduct peer code reviews, unit testing, and automated validation; ensuring deliverables meet submission-quality and reproducibility standards
  • Mentor and guide team members in best practices for programming, validation, and automation

Benefits

  • 99% of the premium paid for medical, dental and vision plans.
  • company-paid life insurance, AD&D, disability benefits, and voluntary plans to personalize your coverage
  • dollar-for-dollar up to 6% on eligible 401(k) contributions and sweeten the deal with long-term stock incentives and ESPP
  • discretionary quarterly bonus
  • extremely flexible wellness benefit
  • generous PTO
  • paid holidays and company-wide shutdowns
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service