Anthropic is seeking Research Engineers to build and implement evaluations for their AI systems, specifically focusing on Claude. The role involves turning abstract concepts of intelligence into measurable metrics, designing and executing evaluations across Claude's capabilities and personality, and developing the infrastructure to run these evaluations at scale. The goal is to establish Anthropic as a leader in well-characterized AI systems with exhaustively measured and validated performance. This position requires close collaboration with researchers throughout the lifecycle of new capabilities, from defining measurement criteria to interpreting results.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level