Principal Scientist, Data Science – R&D DSDH - Therapeutics Development & Supply (TDS)

Johnson & JohnsonSpring House, PA
$117,000 - $201,250

About The Position

Johnson & Johnson Innovative Medicine is recruiting for Principal Scientist, Data Science – R&D DSDH - Therapeutics Development & Supply (TDS) The primary location for this position is open to Spring House, PA; Malvern, PA; Cambridge, MA; Beerse, Belgium; Madrid, Spain; or Barcelona, Spain. Candidate Interested in our US based locations, please apply to: R-069212 J&J Innovative Medicine develops treatments that improve the health of people worldwide. Research and development areas encompass oncology, immunology, neuroscience, cardiopulmonary and specialty ophthalmology. Our goal is to help people live longer, healthier lives. We have produced and marketed many first-in-class prescription medications and are poised to serve the broad needs of the healthcare market – from patients to practitioners and from clinics to hospitals. To learn more about Johnson & Johnson Innovative Medicine visit https://innovativemedicine.jnj.com/ POSITION SUMMARY The R&D Data Science organization is seeking a Data Scientist – Data Engineer to design, build, and optimize data capture, processing, and storage solutions that enable advanced analytics, digital process transformation, and AI/ML applications across the development‑to‑supply continuum for Therapeutics Development & Supply (TDS). You will be a hands‑on technical contributor working across Process Development, Manufacturing, Supply Chain, Quality, and Digital/Data Science teams to deliver high‑quality, AI‑ready data pipelines and data products. This role involves creating robust, future‑proof data systems, engineering workflows, and high‑value data repositories that support scientific, technical, and operational decision‑making.

Requirements

  • Advanced degree in Engineering, Data Science, Life Sciences, Computer Science, or related field; advanced degree preferred.
  • 3+ years of experience in data engineering, including data modeling and database design, preferably in a scientific, manufacturing, or healthcare environment.
  • Proficiency with Python, R, SQL, and cloud-based architectures (e.g., AWS services, Snowflake, Redshift).
  • Experience with NoSQL and graph databases.
  • Strong analytical, problem‑solving, and stakeholder‑management skills, with the ability to translate discussions into actionable requirements.
  • Ability to drive multiple exciting projects simultaneous with strong organizational skills and adaptability.

Nice To Haves

  • Experience with regulated or standards‑driven data environments, such as CDISC, HL7, FHIR, OMOP, DICOM, or manufacturing/quality data standards.
  • Familiarity with high‑dimensional data (e.g., imaging, sensor data, etc).
  • Experience with principles connecting to or feeding MLOps and model deployment workflows.
  • Knowledge of manufacturing systems (MES), laboratory information systems, or industrial data systems.
  • Exposure to knowledge graph or ontology‑driven architectures.

Responsibilities

  • Data Engineering & Pipeline Development Design, build, and maintain scalable data pipelines for acquiring, integrating, and managing TDS data from diverse data generation sources and systems (e.g., lab systems, MES, clinical supply, quality systems, external partners).
  • Create and optimize data flows for structured and unstructured data using Python, R, SQL, cloud services, and other modern engineering tools.
  • Develop and maintain TDS‑specific data repositories, implementing enterprise‑level data models and creating new models as needed.
  • Enable AI/ML readiness by ensuring data is well‑structured, versioned, traceable, and semantically aligned with enterprise data standards.
  • Data Product & Architecture Partnership Partner with data scientists, TDS domain experts, and digital technology teams to translate business needs into high‑quality data products and engineering requirements.
  • Work closely with ontology/knowledge graph teams to implement semantic models and future‑proof data architectures.
  • Quality, Compliance & Performance Implement data quality and performance standards; define KPIs to measure accuracy, completeness, and consistency across TDS data assets.
  • Apply data versioning and lineage tracking for compliance, traceability, and audit readiness.
  • Follow software development best practices including code versioning, DevOps integration, and documentation.
  • Cross‑Functional Collaboration Engage with scientific, technical, and operations stakeholders to understand requirements, design data solutions, and drive adoption.
  • Support multiple concurrent projects, managing priorities and delivering maximum business value across the TDS network.

Benefits

  • Vacation –120 hours per calendar year
  • Sick time - 40 hours per calendar year; for employees who reside in the State of Colorado –48 hours per calendar year; for employees who reside in the State of Washington –56 hours per calendar year
  • Holiday pay, including Floating Holidays –13 days per calendar year
  • Work, Personal and Family Time - up to 40 hours per calendar year
  • Parental Leave – 480 hours within one year of the birth/adoption/foster care of a child
  • Bereavement Leave – 240 hours for an immediate family member: 40 hours for an extended family member per calendar year
  • Caregiver Leave – 80 hours in a 52-week rolling period
  • Volunteer Leave – 32 hours per calendar year
  • Military Spouse Time-Off – 80 hours per calendar year

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Principal

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service