The Special Projects team at Apple is developing novel user-facing conversational features that leverage the multimodal capabilities of state-of-the-art foundation models. As part of this process, they generate real-world and simulated data, gather human data annotations, analyze the results, and use them to build and evaluate Large Language Model judges. The team is looking for a skilled Data Scientist to join their Machine Learning Evaluations teams. This person will work closely with ML Engineers to manage and analyze human and automated data annotation processes, and to develop, test, and refine LLM judges for generative AI model evaluation. A successful candidate is experienced in survey design, data annotation, LLM prompt engineering and prompt optimization, and has strong statistical analysis skills.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Career Level
Mid Level
Number of Employees
5,001-10,000 employees