To empower AI innovation by accelerating the delivery, cloud-based accelerator (GPU) NPIs built into large-scale supercomputer clusters, including next-gen cross-functional development, customer and vendor partnerships, and ML workload monitoring and diagnostic tooling. As a GPU Technical Program Manager for Google Cloud’s AI and Computing Infrastructure team, you will be at the forefront of AI innovation, leading the end-to-end development and delivery of next-generation Cloud GPU products from initial concept to full-scale production. You will take charge of software qualification and release strategies for AI hypercompute clusters, collaborating deeply with engineering, product, and capacity planning teams to align customer and business priorities. Beyond managing critical escalations and mitigating risks, this is a unique opportunity to shape cross-functional initiatives alongside Application Centric Infrastructure (ACI) leadership and Technical Program Managers (TPMs) across the broader organization to streamline customer onboarding and scaled support for our largest, most complex Cloud ML solutions. The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world. We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior