Data/ML Infrastructure Engineer

Matter IntelligenceSan Francisco, CA
6dOnsite

About The Position

We are seeking a Data Infrastructure Engineer to build and operate the infrastructure that turns drone, aerial, and orbital sensing data into production datasets, models, and customer-facing insights. This role spans ingestion, processing, storage, compute, and serving, with a strong emphasis on reliability, observability, performance, and cost. You will work closely with research and product engineering to shorten iteration cycles, improve reproducibility, and raise the quality bar for production systems. You will define clear interfaces and operational standards that keep the platform trustworthy as data volume, model complexity, and product usage scale.

Requirements

  • Meaningful experience building production data infrastructure, ML infrastructure, or distributed systems
  • Strong programming skills in Python and SQL, with the judgment to choose the right abstractions and interfaces for production systems
  • Experience building and operating systems on AWS
  • Familiarity with modern infrastructure and platform tooling, including Kubernetes, Docker, and Terraform
  • Experience working with production storage and serving systems such as Postgres and Redis
  • Familiarity with data and ML workflow tooling such as Metaflow
  • Strong instincts for observability, testing, and operational excellence

Nice To Haves

  • Experience supporting ML training, evaluation, batch inference, or model deployment in production
  • Familiarity with modern large-scale data patterns and tooling, including streaming, backfills, partitioning strategy, and schema evolution
  • Experience building internal platform primitives such as data versioning and lineage, dataset curation, experiment tracking, or tooling for reproducible workflows
  • Exposure to perception, multimodal, or geospatial systems, especially where data originates from real sensors and is used in real products

Responsibilities

  • Design, build, and operate scalable data and ML infrastructure on AWS, including workloads running on Kubernetes
  • Build and maintain systems for ingestion, processing, storage, and serving, with strong guarantees around data quality, correctness, and operational safety
  • Partner closely with research to support perception model training and evaluation workflows, enabling faster experimentation and more reproducible iteration
  • Build platform primitives for observability, data versioning, lineage, evaluation, reproducibility, and operational excellence
  • Partner with product engineering to ensure data- and model-derived insights are accessible through reliable, low-latency serving and retrieval interfaces
  • Design systems that enable efficient access patterns for customer-facing products, including search, indexing, and large-scale querying
  • Identify and address bottlenecks in throughput, cost, and operational complexity as the platform scales

Benefits

  • Competitive compensation based on experience
  • Early-stage equity package
  • 100% employer-paid health, dental, and vision coverage
  • Opportunity to work on novel sensing, data, and AI systems with real-world deployment paths across drone, aerial, and orbital platforms
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service