Build systems that transform powerful pre-trained models into aligned and general agents. Drive research and engineering initiatives that push the frontier of post-training, from data curation to large-scale optimization. Develop data generation pipelines, reward models, reinforcement learning algorithms, and inference-time scaling techniques. Collaborate across pre-training and post-training teams to deliver step-function gains in model capability. Contribute to shaping our understanding of how large models learn to reason, follow instructions, and improve through reinforcement learning.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
51-100 employees