About The Position

Apple's AIML Evaluation team is looking for a seasoned, technical leader to lead our Data Science and Insights team. The organization leads Evaluation for Apple Intelligence, Siri and a large portfolio of other billion+ user facing features in SWE. Successful candidates will have deep experience in traditional human evaluation methodology, logging, and A/B testing, in addition to hands-on experience building and deploying LLM-based autograders and rubrics, and using these tools to proactively drive improvements in models and agentic features. As the head of Data Science and Insights, you'll influence the direction of a wide variety of software features, models, and platforms, in close collaboration with teams across the company. Your experience will enable you to thoughtfully balance the various tradeoffs involved in creating successful features that meet Apple's high customer expectations for both quality and privacy.

Requirements

  • 10+ years of experience in data science and machine learning evaluation, including 6+ years leading large technical teams
  • Advanced degree in a quantitative field such as Statistics, Computer Science, Machine Learning, or similar
  • Demonstrated track record of running organizations of 50+ data scientists and/or machine learning engineers
  • Deep experience in human evaluation methodology, logging, and AB testing for consumer-facing products at scale
  • Hands-on experience building and deploying LLM-based autograders and rubrics, and using them to drive proactive improvements in models and agentic features
  • Strong written and verbal communication skills, able to communicate effectively with engineers and senior leaders, including Senior Vice Presidents

Nice To Haves

  • Experience evaluating large consumer AI products such as conversational assistants, search systems, or agentic features
  • Experience with logging infrastructure and instrumentation for AI product quality measurement
  • Track record of growing senior leaders from within your organization and where needed recruiting senior data science and machine learning talent in competitive hiring markets
  • Familiarity with evaluation frameworks for agentic systems and tool-use
  • Strong written and verbal communication skills, able to communicate effectively with engineers and senior leaders, including Senior Vice Presidents

Responsibilities

  • Influence the direction of a wide variety of software features, models, and platforms, in close collaboration with teams across the company.
  • Thoughtfully balance the various tradeoffs involved in creating successful features that meet Apple's high customer expectations for both quality and privacy.
  • Lead our Data Science and Insights team.
  • Lead Evaluation for Apple Intelligence, Siri and a large portfolio of other billion+ user facing features in SWE.
  • Build and deploy LLM-based autograders and rubrics.
  • Use LLM-based autograders and rubrics to proactively drive improvements in models and agentic features.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service