As a Technical Program Manager for model evaluations, you'll own end-to-end coordination of our evaluation ecosystem— building a feedback loop from shaping eval strategy during early model development through launch execution. You'll be the critical bridge between Research, Product, Marketing, and Engineering teams. This role sits at the intersection of frontier AI research and product launches. Evals are an important part of how we measure whether our models meet the bar—for capability, safety, and competitive positioning. Beyond launch coordination, you'll help scale our evals ecosystem: from early-stage model evals for RL environments, to the systems and infrastructure on which evals run, to tooling that enables the whole pipeline. A strong TPM in this space can immediately reduce chaos during launches while also driving systemic improvements that compound over time.