Technical Program Management, Deployment

DeepMind•Mountain View, CA

About The Position

This role is on the Google DeepMind Deployment team. We drive model serving and deployment to deliver our models from research and training to production. The core focus of this role is the technical program management orchestrating the delivery of inference performance techniques, aligning research and production readiness timelines and working across cross functional workstreams ensuring that Gemini models are ready for production deployment. About Us Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority. The Role This is a high-impact opportunity for an exceptional Technical Program Manager to cover a critical area of Gemini’s launch operations - model deployment and serving, where you will be accelerating all parts of the model deployment process, across preprocessing, serving, evals and performance. You will work closely with a virtual team of contributors across MTV, NY, London, and Zurich, working with research teams to ensure we have clear timelines and roadmaps for our model deployment productionisation efforts. The role requires a high tolerance for ambiguity inherent in cutting-edge AI research and operates at a fast pace. The role requires a highly organized individual who excels at managing complex stakeholder relationships and connecting teams across Google DeepMind to drive successful outcomes.

Requirements

5+ years of leading large-scale complex programs, preferably across multiple geographies and time zones.
Possess excellent communication skills to articulate complex technical concepts clearly to diverse audiences, including executive leadership.
Demonstrated success in fast-paced program execution and deliveries.
Must be fluent in English and flexible to work cross-timezone.

Nice To Haves

Degree in Computer Science, Engineering or equivalent professional experience.
Strong understanding of ML/AI principles and the distinctions from traditional software development.
Understanding of LLM inference mechanics (KV Cache management, attention mechanisms, sampling strategies).

Responsibilities

Be the US timezone presence for Deployment and partner with a senior TPM in the EMEA timezone to ensure US and Europe coverage.
Drive the end-to-end delivery of the model serving and deployment program, working closely with engineers and researchers to architect and roadmap deployment plans from research to scaling out for production.
Dive deep into technical details, understanding the nuance of production inference and the trade offs between cost and performance optimisation.
Drive organizational efficiency by improving processes, streamlining execution, and implementing tooling improvements where applicable.
Proactively identify and drill down into blockers, moving quickly to unblock the research and engineering teams.
Coordinate across diverse groups within Gemini, including Research teams, Compute, and Go to Market and ensuring all stakeholders are aligned on plans and timelines.
Build consensus and influence outcomes across diverse teams and key stakeholders, ensuring strong alignment without relying on direct authority.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume