Research Engineer, Production Model Post Training

AnthropicSan Francisco, CA
266d$315,000 - $510,000

About The Position

Anthropic's production models undergo sophisticated post-training processes to enhance their capabilities, alignment, and safety. As a Research Engineer on our Post-Training team, you'll develop and optimize the systems that transform our base models into the refined Claude models that users interact with. You'll work at the intersection of cutting-edge research and production engineering, implementing, scaling, and improving post-training techniques like Constitutional AI, RLHF, and other alignment methodologies. Your work will directly impact the quality, safety, and capabilities of our production models.

Requirements

  • Strong software engineering skills with experience building complex ML systems
  • Comfortable working with large-scale distributed systems and high-performance computing
  • Experience with training, fine-tuning, or evaluating large language models
  • Ability to balance research exploration with engineering rigor and operational reliability
  • Adept at analyzing and debugging model training processes
  • Enjoy collaborating across research and engineering disciplines
  • Ability to navigate ambiguity and make progress in fast-moving research environments
  • Keen interest in AI safety and responsible deployment
  • Proficiency in Python, deep learning frameworks, and distributed computing

Nice To Haves

  • Experience with LLMs is a significant plus
  • Hands-on experience with frontier AI systems

Responsibilities

  • Implement and optimize post-training techniques at scale on frontier models
  • Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation
  • Develop tools to measure and improve model performance across various dimensions
  • Collaborate with research teams to translate emerging techniques into production-ready implementations
  • Debug complex issues in training pipelines and model behavior
  • Help establish best practices for reliable, reproducible model post-training

Benefits

  • Competitive compensation
  • Generous vacation and parental leave
  • Flexible working hours
  • Optional equity donation matching
  • Lovely office space for collaboration
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service