Senior Data Engineer

TranslucentNew York, NY
Hybrid

About The Position

Translucent is seeking a Senior Forward Deployed Engineer with a data engineering focus to join their team in New York. This role sits at the intersection of customer financial and clinical data and the agentic AI systems being built. The engineer will be responsible for building and maintaining data integration interfaces, pipelines, and semantic layers to make healthcare data understandable to Translucent's products. They will also collaborate with platform engineers to ensure the core data systems are efficient, accurate, and well-tested. This is a high-ownership role where the individual will work directly with customers to understand their financial and clinical data models and translate this into robust data systems.

Requirements

  • Strong SQL optimization experience in a columnar data warehouse (BigQuery, Snowflake, Redshift, or similar), or Databricks.
  • Strong Python skills, including hands-on experience with pandas (must have) and the broader scientific Python ecosystem.
  • Strong preference to use types where possible, we use pydantic liberally.
  • DBT experience preferred.
  • Strong grounding in the fundamentals of computation, logic, and algorithms.
  • Solid data modeling instincts: comfortable designing schemas, reasoning about normalization vs. denormalization, and modeling complex domain entities.
  • Experience with orchestration tools (e.g. Prefect, Airflow, Dagster, Fivetran, Airbyte) etc.
  • 3-5+ years of post-university industry experience

Nice To Haves

  • Hands-on experience with GCP (BigQuery, GCS, Spanner, IAM, etc.).
  • Prior exposure to healthcare data (claims, EHR, financial systems) or other regulated/complex domains.
  • Experience working directly with customers or stakeholders in a forward-deployed, integrations, or solutions engineering capacity.
  • Golang experience.

Responsibilities

  • Build and maintain abstracted data integration interfaces for healthcare data.
  • Capture and understand the source-of-truth financial and clinical record models used by our customers. Potentially go on-site with customers to capture business logic.
  • Optimize data ingestion and transformation pipelines for correctness, performance, and cost.
  • Build semantic understanding into our core data systems so they can power agentic AI workflows.
  • Partner with platform engineers to build and maintain the core data systems our products depend on.
  • Validate data pipeline correctness and build rigor into our integration test suites at every opportunity.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service