About The Position

Apple is where individual imaginations gather together, committing to the values that lead to great work! Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It's the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you'll do more than join something - you'll add something! As part of the ML Compute team, you will be at the heart of designing, building, and maintaining the compute infrastructure that enables model training, tuning, and other ML tasks for machine learning, artificial intelligence, and computer vision applications. Make an impact by shaping and owning critical compute components, and be proud to contribute to the infrastructure that enables intelligent experiences across all Apple products and services, used by billions of people worldwide.

Requirements

  • Strong understanding of batch scheduling systems and high-performance computing environments.
  • Experience using system monitoring tools, automated testing frameworks and CI/CD pipelines.
  • Experience with GPUs and/or other ML accelerators in the context of Machine Learning.
  • A passion for making simple, robust, and scalable platforms used by other engineering teams.
  • Flexibility/adaptability for working in a dynamic environment with different frameworks and requirements.
  • Masters or PhD in Computer Science.

Responsibilities

  • Own the architecture, design, development, and operations of large-scale systems designed for machine learning.
  • Develop custom scheduling, resource management solutions, and fleet management for our ML model training compute infrastructure.
  • Collaborate with multi-functional teams, integrate with Kubernetes in on-premises and cloud provider clusters, and enable seamless integration with NVIDIA GPUs and other ML accelerators.
  • Partner with data scientists and machine learning engineers across different Apple organizations to define high-impact product features and deliver them with quality.
  • Lead a group of engineers to deliver high-quality products/services.
  • Support junior engineers by providing advice, mentoring, and educational opportunities.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Computer and Electronic Product Manufacturing

Education Level

Master's degree

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service