We are seeking talented engineers to join our team and push the boundaries of evaluations for Siri AI Agents. Evaluation lies at the heart of our model development strategy—it shapes architectural choices, guides launch decisions, and ultimately ensures a world-class user experience. Our team is highly innovative and fast-moving, leveraging auto-evaluators and LLM-based judges to measure, validate, and continuously improve the core Siri AI engine. If you’re excited by the challenge of building trusted evaluation systems that directly impact the quality of a groundbreaking AI product used by millions worldwide, this role is for you.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Education Level
Master's degree
Number of Employees
5,001-10,000 employees