Data Standardization Intern

Johnson & Johnson Innovative MedicineHopewell Township, NJ
1d$23 - $52

About The Position

We are recruiting a highly motivated and detail‑oriented Summer Intern to support strategic initiatives focused on data standardization, data connectivity, and the development of AI‑ready datasets. The intern will contribute to foundational work that enhances data interoperability, improves data quality, and accelerates the creation of enterprise‑scale analytics and AI solutions. This role provides an excellent opportunity to gain hands‑on experience in data engineering, metadata standards, and AI enablement within a scientific and R&D context.

Requirements

  • Currently pursuing a bachelor’s, master’s or Ph.D’s degree in Data Science, Computer Science, Information Systems, Bioinformatics, Engineering, or a related discipline.
  • Foundational knowledge of Python, SQL, or equivalent programming languages for data manipulation and analysis.
  • Understanding of core data concepts, including data models, schemas, metadata, ontologies, and data governance principles.
  • Demonstrated interest in AI systems, data engineering, or machine learning workflows.
  • Strong analytical and problem‑solving skills, with exceptional attention to detail.
  • Effective communication skills and the ability to work both independently and collaboratively.
  • Permanently authorized to work in the U.S., must not require sponsorship of an employment visa (e.g., H-1B or green card) at the time of application or in the future. Students currently on CPT, OPT, or STEM OPT usually requires future sponsorship for long term employment and do not meet the requirements for this program unless eligible for an alternative long-term status that does not require company sponsorship.

Nice To Haves

  • Familiarity with scientific or clinical data standards such as CDISC, FHIR, OMOP, internal ontologies, or FAIR data principles.
  • Exposure to modern cloud platforms (e.g., Azure, AWS) and data tooling.
  • Experience with workflow orchestration tools (e.g., Nextflow, Airflow) or scientific data pipelines.
  • Understanding of R&D, clinical, omics, or experimental data environments.

Responsibilities

  • Support ongoing data standardization efforts, including harmonization of data structures, formats, and terminologies across multiple scientific and operational data sources.
  • Assist in the development and enhancement of data connectivity frameworks that improve interoperability between platforms, pipelines, and analytical systems.
  • Contribute to the preparation of AI‑ready datasets by implementing best practices in schema management, metadata curation, lineage documentation, and quality assessment.
  • Conduct exploratory data analyses to evaluate data completeness, consistency, and harmonization needs.
  • Collaborate with cross‑functional partners—including Data Engineering, Data Governance, and AI/ML teams—to capture requirements and support delivery of standardized data assets.
  • Document workflows, data definitions, technical decisions, and process improvements to support organizational knowledge sharing and operational scalability.
  • Assist in prototyping or testing automation approaches for data validation, transformation, and standardization where appropriate.

Benefits

  • Co-Ops/Interns are eligible to participate in Company sponsored employee medical benefits in accordance with the terms of the plan.
  • Co-Ops and Interns are eligible for the following sick time benefits: up to 40 hours per calendar year; for employees who reside in the State of Washington, up to 56 hours per calendar year.
  • Co-Ops and Interns are eligible to participate in the Company’s consolidated retirement plan (pension).
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service