Principal Engineer – PCS Data Fabric

GE HealthCare
7d$188,000 - $282,000Hybrid

About The Position

Join us in building clinical grade data platforms that power meaningful insights across connected devices, diagnostics, and digital health. In this role, you will shape scalable data architecture on AWS that enables secure ingestion, governance, analytics, and responsible AI/ML. Your work will have a direct impact on improving patient outcomes and supporting clinicians, product teams, and partners around the world. We welcome applicants from all backgrounds—especially those historically underrepresented in tech—and encourage candidates who meet most, but not all, of the qualifications to apply. We value curiosity, collaboration, and a growth mindset.

Requirements

  • This role will need to work out of the central time zone.
  • 12+ years of experience in data or analytics platforms.
  • 6+ years leading AWS data architecture at scale.
  • Deep expertise with S3, Lake Formation, Glue, Athena, EMR, Redshift, Kinesis/MSK, and SageMaker.
  • Experience governing PHI and regulated ML workflows.

Nice To Haves

  • Experience with table formats such as Apache Iceberg, Delta Lake, or Hudi, and ACID‑on‑lake patterns.
  • Knowledge of CDC ingestion (DMS).
  • Familiarity with curated imaging pipelines (DICOM) and vector search for clinical text/notes.
  • FinOps practices for data platforms (tiering, compression, query optimization).

Responsibilities

  • Data Platform Architecture Design and evolve cloud‑native data platforms using S3, Lake Formation, Glue (catalog/ETL), Athena, EMR/EKS‑Spark, Redshift (including serverless), and Kinesis/MSK for streaming. Define lake and lakehouse patterns, real‑time and batch pipelines, and governed self‑service analytics capabilities.
  • Governance & Privacy Implement PHI tokenization/pseudonymization, fine‑grained access controls (column/row level), Macie discovery, encrypted storage (KMS), and data retention/lineage strategies using Glue and tags. Apply DLP and other privacy‑preserving controls aligned with HIPAA, GDPR, HITRUST, and FDA/ISO frameworks.
  • Interoperability Enable data exchange using FHIR, DICOM, HL7, and device telemetry through IoT Core into streaming and lake layers.
  • ML & MLOps Build governed ML workflows with SageMaker pipelines, model registry, lineage tracking, explainability, and bias reporting. Support dataset versioning and incorporate human‑in‑the‑loop processes when needed.
  • Self‑Service Data & Data Products Lead data mesh/product governance, enable Redshift/Athena consumption, support DataZone cataloging and access workflows, and utilize Clean Rooms for privacy‑preserving collaboration.
  • Reliability & Performance Architect for resiliency across multi-AZ/multi-region deployments, including S3 replication, lifecycle management, partitioning/compaction, and cost‑efficient performance tuning.
  • Validation & Auditability Maintain validation packages for regulated analytics and AI pipelines, including traceable lineage and CFR Part 11 evidence.

Benefits

  • A collaborative environment where diverse perspectives are valued.
  • Opportunities for ongoing learning, mentorship, and professional growth.
  • Flexibility, autonomy, and support from peers across engineering and product teams.
  • The chance to build solutions that have a real impact in healthcare.
  • GE HealthCare offers a competitive benefits package, including not but limited to medical, dental, vision, paid time off, a 401(k) plan with employee and company contribution opportunities, life, disability, and accident insurance, and tuition reimbursement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service