Research Engineer, Production Model Post Training

Anthropic•San Francisco, CA

319d•$315,000 - $510,000

About The Position

Anthropic's production models undergo sophisticated post-training processes to enhance their capabilities, alignment, and safety. As a Research Engineer on our Post-Training team, you'll develop and optimize the systems that transform our base models into the refined Claude models that users interact with. You'll work at the intersection of cutting-edge research and production engineering, implementing, scaling, and improving post-training techniques like Constitutional AI, RLHF, and other alignment methodologies. Your work will directly impact the quality, safety, and capabilities of our production models.

Requirements

Strong software engineering skills with experience building complex ML systems
Comfortable working with large-scale distributed systems and high-performance computing
Experience with training, fine-tuning, or evaluating large language models
Ability to balance research exploration with engineering rigor and operational reliability
Adept at analyzing and debugging model training processes
Enjoy collaborating across research and engineering disciplines
Ability to navigate ambiguity and make progress in fast-moving research environments
Keen interest in AI safety and responsible deployment
Proficiency in Python, deep learning frameworks, and distributed computing

Nice To Haves

Experience with LLMs is a significant plus
Hands-on experience with frontier AI systems

Responsibilities

Implement and optimize post-training techniques at scale on frontier models
Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation
Develop tools to measure and improve model performance across various dimensions
Collaborate with research teams to translate emerging techniques into production-ready implementations
Debug complex issues in training pipelines and model behavior
Help establish best practices for reliable, reproducible model post-training

Benefits

Competitive compensation
Generous vacation and parental leave
Flexible working hours
Optional equity donation matching
Lovely office space for collaboration

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Publishing Industries

Education Level

Bachelor's degree

Research Engineer, Production Model Post Training

About The Position

Requirements

Nice To Haves

Responsibilities

Benefits

What This Job Offers

Job Search Resources

Tools

Career Hubs

Guides

Company