Azure is Microsoft’s central cloud infrastructure hosting both public cloud offerings and a wide range of Microsoft internal cloud-scale services. Cloud computing is a highly competitive and rapidly growing market, and Azure aims to be an industry leader across all relevant aspects of its platform and services. Within Azure, the Azure Compute team serves as the core infrastructure organization responsible for hosting virtual machines (VMs), containers, and other workloads. One of the foundational disciplines in cloud computing is capacity management, which must ensure sufficient capacity across all regions, allocation domains, and hardware infrastructure to meet customer demand while also ensuring efficient provisioning to avoid overspending and impacts to cost of goods sold (COGS) and capital expenditures (CAPEX). At the scale of Azure Compute, managing the balance between availability and efficiency across the entire fleet is a highly complex challenge, where improvements can prevent customer allocation failures and deliver significant savings. The Azure Compute Capacity and Efficiency (AC2E) team is responsible for managing all aspects of capacity and efficiency across the fleet. The team’s primary responsibility is to deliver a fully automated and highly optimized tracking and management system. This system, which includes the Capacity Management Automation System (CMAS) as a core component, uses advanced algorithms and artificial intelligence (AI) to predict capacity risk and execute appropriate mitigation actions directly within the Azure Compute platform. As a member of this team, individuals work closely with engineers, program managers, and data scientists across Azure Compute platform teams, as well as partners in capacity planning. Responsibilities include formulating business problems and driving solutions end to end, from design through production, while contributing to strategic decision-making for features that impact capacity and efficiency. The impact of this work is reflected in improvements to the Azure platform, service capacity fulfillment rates, customer satisfaction, and efficiency metrics including reductions in cost of goods sold (COGS). The team is composed of data science and development engineers and collaborates closely with program management partners, applying techniques such as anomaly detection, machine learning, and experimentation methodologies to solve complex real-world problems and deliver innovation at scale. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level