About The Position

We are part of the Core AI Platform team at Microsoft, that builds the platforms, services, and operating mechanisms that power Microsoft’s rapidly growing AI model ecosystem. Our mission enables every developer to achieve more by using AI tools and our platform capabilities to infuse AI in their applications and services. We are looking for a Senior Technical Program Manager with technical depth, customer empathy, and an abundance of energy. The right candidate is highly effective, taking the initiative and thriving in building products that differentiate in the most competitive segment of the industry. This role is in the CoreAI Infrastructure team which powers the Microsoft AI Foundry Services. We manage the GPU fleet to run Microsoft’s AI services on a planet scale. We are in the eye of the storm to accelerate the transition and scaling of Generative AI models to work with the latest AI aware application. Our team focuses on fleet efficiency, reliability and the agility to bring the latest AI innovations from research into production. We obsess about customers, make data driven decisions, and have a strong focus on delivering impact. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Requirements

  • Bachelor's Degree AND 4+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience.
  • 2+ years of experience managing cross-functional and/or cross-team projects.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice To Haves

  • 5+ years of experience designing and shipping complex products for developers, ML professionals, or similar audiences
  • 3+ years of experience with distributed platform ecosystems (e.g., multi‑tenant systems, real‑time processing, batch computing)
  • Proven experience in technical program management, preferably supporting AI infrastructure or cloud services
  • Strong understanding of GPUs, virtual machines, operating systems, and cloud infrastructure fundamentals
  • Experience navigating ambiguity and driving clarity across complex, cross‑functional initiatives
  • Experience building solutions using Azure, AWS, or Google Cloud
  • Experience writing Python, including for machine learning workloads
  • Experience with machine learning platforms
  • Experience driving complex, multi‑stakeholder processes and cross‑team programs
  • Ability to build effective relationships, influence, and collaborate at all organizational levels
  • Strong verbal and written communication skills for a global audience
  • Strong business acumen with an analytical, detail‑oriented approach
  • Experience leading cross‑functional teams and managing complex dependencies
  • Ability to thrive in fast‑paced environments with a bias for action

Responsibilities

  • Define product strategy, roadmap, and success metrics for platform capabilities that increase overall fleet efficiency, while delivering cost-effective, high-performance solutions for customers running AI workloads.
  • Define and track efficiency metrics, manage dependencies and lead experimentation to validate hypotheses and drive optimizations
  • Partner with engineering, data science, finance, and partner teams; manage dependencies and unblock execution in ambiguous environments
  • Build crisp narratives and dashboards that support decision-making and keep stakeholders aligned on progress, tradeoffs, and outcomes
  • Translate customer needs and platform constraints into clear requirements and iterative delivery plans
  • Identify high-impact opportunities to increase capacity efficiency.
  • Provide clarity in ambiguous environments, influence stakeholders, and foster a culture of innovation and accountability
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service