Research Scientist, Science of Post Training

Openai-posted 7 months ago

Full-time • Mid Level

Hybrid • San Francisco, CA

Professional, Scientific, and Technical Services

Resume

Match Score

Upload and Match ResumeTrack Jobs with Teal

The Post-Training team is responsible for training and improving pre-trained models to be deployed into ChatGPT, the API, and future products. The team partners closely with research and product teams across the company, and conducts research as a final step to prepare for real world deployment to millions of users, ensuring that our models are safe, efficient, and reliable. The Science of Post-training team is responsible for advancing the frontier of RLHF. We combine rigorous scientific experimentation with strong technical execution to drive progress in model alignment. Our goal is to develop insights that would make model training more robust and efficient. We contribute to core model deployments like GPT-4.1 and o3, but our main mandate is to pursue foundational research that will guide the trajectory of future model development.

Design and execute experiments to study the mechanics and efficacy of RL algorithms
Develop new theoretical frameworks to explain and predict behavior of post-training systems
Collaborate with cross-functional teams on the deployment and evaluation of safe, aligned models in production

Strong foundation in computer science, statistics, machine learning, physics, robotics, or a similarly rigorous theoretical and empirical discipline
Background in reinforcement learning research
Strong programming skills and comfort with low-level technical details

Excited to work at the intersection of scientific research and real-world deployment
Value rapid iteration via simple, well-executed experimentation

Relocation assistance to new employees
Hybrid work model of 3 days in the office per week

Track Jobs with Teal

Job Search Resources

•

Resume Builder

•

Research Scientist Resume Examples

•

Research Scientist Cover Letter Examples

Research Scientist, Science of Post Training

Job Search Resources

Tools

Career Hubs

Guides

Company