As a ML Engineering Manager for Foundation Model Runtime, you will manage and lead top flight ML engineers to deliver high performance inference solutions for generative AI features that power tentpole Apple features. You will work across organizations to connect product teams, model teams, FM services, cloud infrastructure, and your own team to deliver carefully engineered, high performance solutions aligned with critical milestones. You will also guide the team to architect for maximum leverage in the ML space, helping the team adapt the latest innovations in generative AI inference. Your role as Engineering Manager will provide support and leadership to incredible ML inference talent to deliver state of the art inference features and performance across Apple. This role offers the opportunity to turn cutting edge inference techniques into carefully crafted engineering solutions that will power both experimentation and production serving of generative AI models across multiple modalities. You will be gauging impact and aligning engineering resources to tackle new hardware and environments, meeting latency, throughput, and quality targets. You will develop technical talent and leadership in your team, and grow your team to meet new inference challenges. You will work cross functionally to set grounded expectations and provide key data that influence AI product development decisions and hardware investments.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager