The Agent Post-Training team at OpenAI is responsible for training frontier agents that are shipped to the world, including models for Codex, ChatGPT, and the API. These agents are designed to be persistent, proactive intelligences capable of operating computers, collaborating with people and other agents, and expanding human potential. The team defines future agent capabilities, develops the training signals for these abilities, and conducts experiments to realize them. Their work encompasses areas such as coding, tool use, computer operation, multi-agent coordination, long-horizon execution, factuality, instruction following, calibrated reasoning, and taste. This team is central to the creation of new model capabilities, building the data, environments, graders, training methods, and feedback loops that shape OpenAI's next generation of agents and bring them into production. As a member of the Agent Post-Training, Artifacts team, you will focus on training frontier models to produce polished and useful work products, including documents, spreadsheets, slide decks, dashboards, reports, analyses, and other interactive or editable artifacts. The role involves teaching models to transform vague user goals into finished artifacts with strong structure, visual appeal, domain judgment, correctness, and low latency. This requires owning improvements across the post-training stack, including Reinforcement Learning (RL), data pipelines, graders, reward signals, evaluations (evals), and behavioral analysis. You will collaborate with researchers, engineers, product teams, infrastructure teams, and safety/alignment partners to determine content for major model runs, measure their success, and implement improvements in products used by real people. This is a high-agency role for individuals who want their work to directly impact frontier models.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed