This role builds the evaluation infrastructure that answers questions about the effectiveness of AI safety systems. You'll sit at the intersection of applied ML research and engineering — designing experiments to measure how well an investigative agent performs across harm areas, building datasets that represent real abuse rather than synthetic benchmarks, and shipping those methods into pipelines that gate every change to the system. Your work directly determines how much trust Anthropic can place in its automated abuse detection, and where we invest to make it better.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior