Data Engineer, Human Cohorts

CalicoSouth San Francisco, CA
$191,000 - $195,000Onsite

About The Position

Calico is seeking a Data Engineer to join our highly collaborative Engineering team and focus on developing high-performance research data infrastructure for large human cohorts. To succeed, you will need to be an enthusiastic team player, detail-oriented, extremely organized, and comfortable working on complex data, software, and scientific problems. In this position, you will be the engineering lead for data infrastructure to support our human biology teams. You will drive projects from requirements-gathering to production deployment, engineering high-performance data systems that integrate with our internal data systems and our internally-developed AI platform.

Requirements

  • BS/MS/PhD in Computer Science, Data Science, or a related technical field, or equivalent practical experience
  • 4+ years (for BS/MS) or 1-2 years (for PhD) of professional software or data engineering experience developing robust, production-grade, and high-performance R&D-focused information systems
  • Experience working with large-scale biological datasets
  • Fluency in Python and SQL with a strong grasp of software and data engineering principles (testing, modularity, design patterns, data modeling)
  • Demonstrated experience developing and deploying cloud-based applications on Google Cloud Platform (GCP) (preferred), AWS, or Azure
  • Strong experience with modern web frameworks and infrastructure, specifically FastAPI, React, Kubernetes, and Terraform
  • Proven ability to lead complex projects involving diverse stakeholders (e.g., ML engineers, computational biologists, bench scientists) from concept to production
  • Experience enforcing robust data governance policies and compliance with internal information security standards and best practices
  • Must be willing to work onsite at least four days per week

Responsibilities

  • End-to-End Project Ownership: Collaborate with data scientists and bench scientists to gather requirements, architect solutions, and deploy production-grade software that facilitates data movement, transformation, analysis, and visualization
  • Data Flow Architecture: Define and optimize data flows across the organization
  • Full-Stack Tool Development: Develop data systems and internal web applications (using React and Python) that allow stakeholders to review, visualize, and communicate complex scientific data
  • Mentorship & Leadership: Serve as a strong technical voice within a larger Engineering team; provide mentorship to junior engineers across Calico and help onboard future hires
  • Engineering Excellence: Champion best practices for infrastructure-as-code, CI/CD, and containerization while helping to set standards for data engineering at Calico

Benefits

  • two annual cash bonuses
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service