The Agent Post-Training team at OpenAI is responsible for creating the frontier agents that OpenAI ships to the world. This includes training the models behind products like Codex, ChatGPT, and the API, focusing on developing persistent, proactive intelligence capable of operating computers, collaborating with humans and other agents, and expanding human potential. The team defines future agent capabilities, builds the training signals, and conducts experiments to realize these advancements. Their work encompasses areas such as coding, tool use, computer operation, multi-agent coordination, long-horizon execution, factuality, instruction following, calibrated reasoning, and taste. This team is where new model capabilities are developed, involving the creation of data, environments, graders, training methods, and feedback loops that shape OpenAI's next generation of agents, which are then carried through major training runs and into user-facing products. As a Context Researcher on the Agent Post-Training team, the primary focus is on scaling the compute spent on context, which is believed to be the final enabler for AGI. This role involves working within OpenAI's frontier training stack to enable the next paradigm of model training, with a clear product interface for iterative deployment (Codex Chronicle). The researcher will collaborate with various teams, including researchers, engineers, product teams, infrastructure teams, and safety/alignment partners, to determine the content of major model runs, measure their success, and implement improvements into widely used products. This is a high-agency position for individuals who want their work to directly influence frontier models.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed