You will play a critical role in scaling up global evaluation of Apple AIML products, with the primary focus on next generation Siri and Apple Intelligence features. You will drive and scale our evaluation work to enable high-velocity development and shipping of Generative AI features globally, in every country and language where Apple AIML features are available. You will drive LLM-based evaluation as a product, delivering simulation-based evaluation of personalized user experiences, reflective of cultural and language diversity of our customers. The focus of this technical lead role is strategy and execution of high quality evaluation datasets grounded in “personas”, acquired and synthetically generated, by language and region. This role requires a combination of engineering experience working with GenAI and ML based products, an ability to drive scale, and a relentless drive for improving signal-to-noise ratio. This role’s success will be driven by building deep cross-functional partnerships and by embracing and leading breakthrough technologies.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level