Anthropic's RL Data team builds the systems that produce high-quality reinforcement learning data for Claude. This includes data collection pipelines, human feedback tooling, execution environments for RL tasks, and quality assurance systems to ensure trustworthy training data at scale. The team's goal is to enhance Claude's capabilities for complex, real-world tasks, with a focus on AI safety research and beneficial AI deployments. This is a foundational role on a new team, offering the opportunity to shape technical direction and initial projects. The work is hands-on and varied, involving pipeline and infrastructure engineering, prompt tuning, and supporting research teams. The role requires engineers who are willing to go beyond core engineering tasks, including reading transcripts, supporting users, and managing vendors.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level