Sr. Engineering Manager, AI/ML Serving Platform

PinterestSan Francisco, CA
6dHybrid

About The Position

The AI/ML Serving Platform team provides foundational tools and infrastructure used by hundreds of AI/ML engineers across Pinterest, including recommendations, ads, visual search, growth/notifications, trust and safety. We aim to ensure that AI/ML systems are efficient, healthy (production-grade quality) and fast (for modelers to iterate upon). Pinterest is seeking a Sr. Engineering Manager to lead the team that builds the serving and deployment infrastructure for all AI/ML models at Pinterest. Systems include: Ultra-high-performance C++ model inference engine for production recommendations and content ranking systems. TorchScript + CUDA Graph models on GPU inference, serving 500+M inferences/second. Production GenAI & LLM model inference stack for emerging use cases. Model routing, deployment, monitoring. Kubernetes-based provisioning. Feature fetching, caching, and logging

Requirements

  • Experience managing platform engineering teams with many cross-organizational customers
  • Experience leading the development of large-scale distributed serving systems
  • Experience with AI/ML inference technologies (e.g. PyTorch, TensorFlow) for online serving at Web scale
  • Bachelor’s degree in Computer Science, a related field or equivalent experience.

Responsibilities

  • Lead the team to deliver continual improvements in advanced model architectures, cost-efficient resource utilization, and AI/ML developer productivity.
  • Set technical direction for the team based on company and org priorities
  • Coach and develop talent on the team.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service