Software Engineer, Observability

OpenAISan Francisco, CA
1d

About The Position

We’re building the observability product for OpenAI—from scalable infrastructure to a rich, AI-powered UI. Our systems ingest over petabytes of logs and billions of time series metrics across our fleet. We're now layering intelligence on top—think agents that summarize SEVs, auto-generate dashboards, or help engineers debug through notebook-like UIs. We’re hiring software engineers across the stack—infra, backend, and product. You’ll join a small, gritty team building both foundational infra and novel internal tools to make OpenAI's production systems reliable, performant, and observable.

Requirements

  • Have operated large-scale distributed systems in production. ( especially logging systems or some other time series databases)
  • Thrive in ambiguous environments and roll up your sleeves to solve unscoped problems.
  • Have full-stack chops or product sensibilities—you're excited to build real tools people use.
  • Have strong fundamentals in systems, networking, and cloud infra (Kubernetes, AWS, etc).

Nice To Haves

  • built or contributed to observability systems (e.g. Prometheus, OpenTelemetry, etc).

Responsibilities

  • Own core observability infrastructure, including distributed logging, time series, and trace storage
  • Build AI-native tools that help engineers detect, understand, and resolve issues autonomously.
  • Contribute to UI experiences like dashboards, notebooking, or interactive debugging
  • Collaborate closely with engineers, researchers, user ops, and other teams across the company to build the next generation observability product
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service