The AI/ML Serving Platform team provides foundational tools and infrastructure used by hundreds of AI/ML engineers across Pinterest, including recommendations, ads, visual search, growth/notifications, trust and safety. We aim to ensure that AI/ML systems are efficient, healthy (production-grade quality) and fast (for modelers to iterate upon). Pinterest is seeking a Sr. Engineering Manager to lead the team that builds the serving and deployment infrastructure for all AI/ML models at Pinterest. Systems include: Ultra-high-performance C++ model inference engine for production recommendations and content ranking systems. TorchScript + CUDA Graph models on GPU inference, serving 500+M inferences/second. Production GenAI & LLM model inference stack for emerging use cases. Model routing, deployment, monitoring. Kubernetes-based provisioning. Feature fetching, caching, and logging
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager