Data Architect

Tempus AIBoston, MA
Remote

About The Position

Passionate about precision medicine and advancing the healthcare industry? Recent advancements in underlying technology have finally made it possible for AI to impact clinical care in a meaningful way. Tempus' proprietary platform connects an entire ecosystem of real-world evidence to deliver real-time, actionable insights to physicians, providing critical information about the right treatments for the right patients, at the right time. Role Overview We are seeking a Data Architect to design the structural backbone of a high-scale, multi-modal healthcare data ecosystem. You will be responsible for architecting data environments that serve massive networks of hospitals, ensuring that diverse data types—ranging from structured EHR records to unstructured genomic and imaging data—are seamlessly accessible to autonomous AI agents. Our goal is to move beyond static data warehousing to create a dynamic, "agent-ready" data fabric that supports real-time clinical evaluation at an enterprise scale.

Requirements

  • Bachelor’s Degree in Computer Science, Health Informatics, or a related field.
  • 7+ years in data architecture or enterprise modeling, with significant experience in the healthcare or life sciences domain.
  • Expert-level knowledge of 3NF, Dimensional (Star Schema), and Data Vault 2.0 modeling techniques.
  • Exceptional SQL skills for complex analytical environments and proficiency in Python for data profiling and debugging.
  • Ability to articulate the trade-offs between RDBMS, MPP, and NoSQL technologies, and experience implementing Master Data Management (MDM) solutions.
  • Deep familiarity with HL7, FHIR, and Epic/Cerner data structures.
  • Proficiency with modeling tools such as Erwin, Vertabelo, or Lucidchart.

Nice To Haves

  • Experience with Vector databases (e.g., Pinecone, Weaviate, or pgvector) or Graph databases to support RAG and agentic memory.
  • Hands-on experience with GCP (BigQuery, Vertex AI) or AWS healthcare-native services.
  • Direct experience working with EHR, OMOP, DICOM, genomic data models, or longitudinal patient records.

Responsibilities

  • Lead the design and management of an enterprise data model that integrates complex domains including clinical EHR records, high-throughput genomics (NGS), and cardiovascular imaging (Echo, Cath, ECG).
  • Architect data solutions designed to scale across federated networks of hospitals, ensuring multi-tenancy, high availability, and performance across hybrid cloud environments.
  • Design data access patterns and metadata layers specifically optimized for AI agents, allowing them to autonomously discover, query, and reason over structured and unstructured datasets.
  • Author and maintain entity-relationship diagrams (ERDs), data dictionaries, and API specifications across multiple technologies (Relational, NoSQL, Vector Databases).
  • Implement automated solutions to monitor data quality and lineage with strict traceability back to source systems, ensuring "ground truth" for agentic evaluations.
  • Educate engineering and clinical teams on data modeling standards, governance, and best practices for maintaining data integrity in a HIPAA-regulated environment.

Benefits

  • incentive compensation
  • restricted stock units
  • medical and other benefits depending on the position
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service