M365 Copilot Inference is a high-impact engineering team advancing applied AI and large-scale machine learning across Microsoft. The team designs and operates the platform powering Microsoft 365 Copilot experiences, running at massive GPU (Graphics Processing Unit) scale across multiple regions and SKUs in global datacenters. It builds core LLM (large language model) API (Application Programming Interface), routing, and capacity control plane services to deliver low-latency, highly available Copilot experiences. We’re hiring a Principal Software Engineering Manager to lead a team focused on control plane automations for capacity buildout. This is a hands-on technical leadership role centered on how Copilot capacity is requested, planned, deployed, and operated. The manager will contribute to capacity planning and custom model deployment automation, partnering closely with peer managers and adjacent areas to shape how the broader control plane evolves. The space spans intake, planning, deployment, fleet health, and unified control plane surfaces. This role is based out of Redmond, WA and employees are expected to work from a designated Microsoft office at least three days a week. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal