Sr. Engineering Manager, AI/ML Serving Platform

PinterestSan Francisco, CA
12hHybrid

About The Position

Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we’re on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product. Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other’s unique experiences and embrace the flexibility to do your best work. Creating a career you love? It’s Possible. Sr. Engineering Manager, AI/ML Serving Platform The AI/ML Serving Platform team provides foundational tools and infrastructure used by hundreds of AI/ML engineers across Pinterest, including recommendations, ads, visual search, growth/notifications, trust and safety. We aim to ensure that AI/ML systems are efficient, healthy (production-grade quality) and fast (for modelers to iterate upon). Pinterest is seeking a Sr. Engineering Manager to lead the team that builds the serving and deployment infrastructure for all AI/ML models at Pinterest. Systems include: Ultra-high-performance C++ model inference engine for production recommendations and content ranking systems. TorchScript + CUDA Graph models on GPU inference, serving 500+M inferences/second. Production GenAI & LLM model inference stack for emerging use cases. Model routing, deployment, monitoring. Kubernetes-based provisioning. Feature fetching, caching, and logging

Requirements

  • Experience managing platform engineering teams with many cross-organizational customers
  • Experience leading the development of large-scale distributed serving systems
  • Experience with AI/ML inference technologies (e.g. PyTorch, TensorFlow) for online serving at Web scale
  • Bachelor’s degree in Computer Science, a related field or equivalent experience.

Responsibilities

  • Lead the team to deliver continual improvements in advanced model architectures, cost-efficient resource utilization, and AI/ML developer productivity.
  • Set technical direction for the team based on company and org priorities
  • Coach and develop talent on the team.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service