About The Position

AI represents a big opportunity to elevate Apple’s products and experiences for billions of people globally. We are looking for Applied Research Scientists with a background and interest in Agentic Systems. You will be leveraging state-of-the-art Generative models to ship extraordinary products, services, and customer experiences for the iPhone, Mac, Apple Watch, iPad and more. The mission of Proactive Intelligence is to improve Apple platforms by better understanding, anticipating and adapting to user behavior by using machine learning to build phenomenal features that are built right into Apple platforms. Our team provides an opportunity to be part of an incredible research and engineering organization within Apple. The ideal candidate will have industry experience across a range of modeling problems relevant to LLM-powered search and agentic systems, including Training and Fine-Tuning Large Language Models (LLMs), Learning from Human Preferences, LLM-based Evaluation and Judging, Retrieval and Planning, and Causal Analysis of Model Failures. Working knowledge of large-scale data processing especially with structured data, probabilistic modeling and statistics will broaden your role and effectiveness in this position. This role contributes directly to Personal Search and Agentic Search inside Siri — the capabilities that let users reason over their own data and act on it through natural conversation.

Requirements

  • Strong programming skills in Python and/or C++ with 6+ years of experience in using these languages for machine learning (ML) modeling and applied research
  • M.S. or PhD in Computer Science, or a related fields such as Electrical Engineering, Robotics, Statistics, Applied Mathematics or equivalent experience.
  • A minimum of 6 years of experience in applied ML and/or product development.
  • Fundamental knowledge of ML concepts and hands-on experience in building deep-learning systems
  • Strong software engineering skills to create scalable and robust infrastructure for machine-learning data, modeling and evaluation systems
  • Proven ability to train and debug machine-learning systems: defining metrics and datasets, performing error analysis and training models in a modern ML framework

Nice To Haves

  • Familiarity with researching current ML literature and math including optimization methods and modeling techniques
  • Passionate about building extraordinary personal and agentic search experiences powered by Generative AI
  • Creative, collaborative and project focused with an ability to work hands-on in multi-functional teams
  • Experience designing LLM-as-Judge methodologies, ground-truth datasets, and diagnostic tooling for generative systems
  • Proficiency in using ML toolkits such as PyTorch, TensorFlow, SkLearn etc.

Responsibilities

  • Design and implement ML algorithms that process data in different Apple products.
  • Train and fine-tune generative models and agentic systems.
  • Design the evaluation methodology and ground-truth surfaces that measure their quality.
  • Build the causal-analysis tooling that explains why agents fail and how to fix them.
  • Integrate ML frameworks into our products and leverage cloud services for scalable training, evaluation, and ablation pipelines across locales.
  • Communicate advanced ideas to a focused team of researchers in the spirit of developing innovative tools and metrics that change the way we look at problems.
  • Work closely with other cross-functional teams to align messaging, contribute to roadmaps and contribute software back into different repos for proper integration with core systems.
  • Write clean, maintainable and production code with appropriate documentation and tests.
  • Contribute to architecture decisions, design reviews and peer code reviews.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service