Infrastructure Tech Lead

OmnifoldSan Francisco, CA
Onsite

About The Position

Omnifold trains custom AI models that help planners forecast the future. We are hiring our first infrastructure tech lead, who will own the systems that make everything else possible. What makes this job interesting: We train a unique model for each customer, which means model training and inference work differently here than at any other company. You’ll never get more reps building model training infrastructure! Our team has very fast iteration speed but needs robust monitoring to pick up signal on user patterns. This is especially important as our application interface for AI-driven forecasting is unique on the market.

Requirements

  • Experience with cloud computing (especially GPU workloads), CI/CD infrastructure-as-code.
  • We run on AWS
  • Familiarity with or interest in ML workflows
  • Security fundamentals: encryption, access controls, compliance basics
  • Python proficiency
  • Ideally ~10 years of experience, including startup experience, with at least 3 years in a tech lead role.
  • 5+ years in infrastructure, DevOps, or platform engineering roles
  • Must have a strong Computer Science background

Responsibilities

  • Deployment: Reliable processes for getting models and services into production
  • Security: Data isolation between customers, product security, infrastructure hardening (SOC2 compliance and beyond)
  • Cloud resource management: GPU allocation, instance sizing, cost optimization
  • Monitoring and logging: Visibility into what's running, what's failing, and why
  • Data and ML ops: ETL pipelines from varied customer data sources, model versioning and lifecycle management
  • Automated testing: Building the test infrastructure that lets us ship with confidence
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service