Pinterest is seeking a Sr. Engineering Manager to lead the team that builds the serving and deployment infrastructure for all ML models at Pinterest. The ML Platform team provides foundational tools and infrastructure used by hundreds of ML engineers across Pinterest, including recommendations, ads, visual search, growth/notifications, trust and safety. We aim to ensure that ML systems are efficient, healthy (production-grade quality) and fast (for modelers to iterate upon). Systems include an ultra-high-performance C++ model inference engine for production recommendations and content ranking systems, TorchScript + CUDA Graph models on GPU inference, serving 500+M inferences/second, production GenAI & LLM model inference stack for emerging use cases, model routing, deployment, monitoring, feature fetching, caching, and logging.