Anthropic is seeking a Software Engineer to join their Safeguards Evals team. This role focuses on building and maintaining the evaluation infrastructure for an AI agent that investigates potential misuse of Claude. The engineer will design experiments, build datasets representing real abuse, and implement methods into pipelines that govern system changes. The work directly impacts the trust in automated abuse detection and guides improvements. The position is at the intersection of applied ML research and engineering.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
Associate degree