About The Position

Foundation Model Services team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.

Requirements

  • Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.
  • Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
  • Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server.

Responsibilities

  • Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time.
  • Work alongside Foundation Model Research team to prototype and develop inference for cutting edge model architectures.
  • Build tools to understand bottlenecks in Inference for different hardwares and use cases.
  • Mentor and guide engineers in the organization.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service