Openai-posted 7 months ago
Full-time • Mid Level
Hybrid • San Francisco, CA
Professional, Scientific, and Technical Services

The Post-Training team is responsible for training and improving pre-trained models to be deployed into ChatGPT, the API, and future products. The team partners closely with research and product teams across the company, and conducts research as a final step to prepare for real world deployment to millions of users, ensuring that our models are safe, efficient, and reliable. The Science of Post-training team is responsible for advancing the frontier of RLHF. We combine rigorous scientific experimentation with strong technical execution to drive progress in model alignment. Our goal is to develop insights that would make model training more robust and efficient. We contribute to core model deployments like GPT-4.1 and o3, but our main mandate is to pursue foundational research that will guide the trajectory of future model development.

  • Design and execute experiments to study the mechanics and efficacy of RL algorithms
  • Develop new theoretical frameworks to explain and predict behavior of post-training systems
  • Collaborate with cross-functional teams on the deployment and evaluation of safe, aligned models in production
  • Strong foundation in computer science, statistics, machine learning, physics, robotics, or a similarly rigorous theoretical and empirical discipline
  • Background in reinforcement learning research
  • Strong programming skills and comfort with low-level technical details
  • Excited to work at the intersection of scientific research and real-world deployment
  • Value rapid iteration via simple, well-executed experimentation
  • Relocation assistance to new employees
  • Hybrid work model of 3 days in the office per week
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service