Principal Engineer, Search and AI Infrastructure, Machine Learning Platform & Infrastructure

Apple•Seattle, WA

About The Position

As part of Apple Services Engineering organization, the Machine Learning Platform & Infrastructure team is building groundbreaking technology for search, natural language processing, artificial intelligence and machine learning. Our infrastructure is the back-bone of Apple Intelligence. It powers the largest Apple foundation models on servers and a wide gamut of services at Apple including Siri, Apple Music, AppleTV, AppStore, Photos & Camera, Spotlight, Safari, and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will work with one of the most exciting high performance computing environments, with petabytes of data, millions of queries per second, and have an opportunity to imagine and build products that delight our customers every single day. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.

Requirements

Bachelor’s degree in Computer Science, relevant technical field, or equivalent practical experience
Strong background in computer science: algorithms, data structures and system design
15+ year experience on large scale distributed system design, operation and optimization with over 10 years of leading teams
Has managed work across a large organization, demonstrated the ability to develop strong leaders, with a consistent track record of executional excellence
Excellent collaboration skills, excelling at both high-level thinking & execution as well as in the ability to influence and inspire others to achieve a common goal

Nice To Haves

Master’s degree or PhD in Computer Science or related technical fields
Experience supporting distributed training inference workloads in production, ML systems performance profiling, debugging, and optimization
Proficiency in cloud-native architectures and orchestration platforms (e.g., Kubernetes)
Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models
Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc
Hands-on experience working with ML accelerators such as GPUs and TPUs

Responsibilities

Provide leadership in building and evolving next-generation AI infrastructure for search and other product needs at Apple.
Shape the architecture and long-term technical strategy for large-scale inference systems that handle both internal workload and production traffic.
Integrate and evolve the web-scale search systems.
Work at the intersection of product innovation, AI research, and large scale distributed systems.
Design, build and maintain infrastructure to support features that empower billions of Apple users.
Take full end-to-end ownership of our services, driving them through every stage meticulously, encompassing conception, design, implementation, deployment, and maintenance.
Work on incredibly complex large scale systems with trillions of records and petabytes of data.
Work alongside teams to optimize inference for cutting edge model architectures.
Build production grade solutions for millions of customers in real time.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume