OpenAI is looking for exceptional researchers to join the Post-Training Frontiers team, which is responsible for post-training the agentic models shipped across Codex, the API, ChatGPT Thinking, and ChatGPT Pro. The team sets up the pipeline for deciding which integrations can go into the post-training run, develops its own horizontal improvements to the model, and trains the final model. The role requires working on the most impactful horizontal improvements for the next model, which could include factuality, instruction following, function calling, multi-agent collaboration, calibrated reasoning effort, tool use, or improving taste in models. This could involve building or improving the grading stack, improving the user-data flywheel, or automating processes to make large post-training runs faster, more reliable, and easier for researchers to use. This team is for individuals who want their work to directly impact models used by hundreds of millions of people. The ideal candidate is deeply technical, highly independent, goal-oriented rather than method-oriented, and excited by the high-agency work of turning research ideas into production model behavior.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed
Number of Employees
1-10 employees