Are you a Principal Site Reliability Engineering Manager interested in improving the reliability of large-scale engineering systems serving multiple major Microsoft divisions? Are you seeking an opportunity to transform how we deliver engineering services, built on a foundation of SRE principles and practices like automated monitoring and alerting, automatic failover, and broadscale service best practices? Are you motivated by coaching and people leadership, helping a team of diverse SREs to unlock their full potential? If so, we have an opportunity for you. The ES365 org is responsible for the engineering systems, tools, and services that comprise the end-to-end developer experiences for the teams that build Office, Exchange, and Microsoft 365, and who work in our largescale web frontend monorepo. Our areas of ownership cover source control, check-in processes, build, validation, and deployment automation. Reliability and operational proficiency are critical to keeping engineering teams productive, and our business needs have shifted from local on-prem operations experience to building and operating reliable cloud services at scale. The Principal Site Reliability Engineering Manager will work effectively with a range of stakeholders, from executives to engineers, balancing near-term reliability improvements with long-term resilience strategies. You will drive cross-org partnerships, establish service level objectives (SLOs) and indicators (SLIs), and lead incident response and continuous improvement through Engineering Service Reviews, SRE service coownership campaigns, and establishing updated service best practice. We believe that significant achievements happen within high-functioning, trust-filled teams. A reliable manager ensures success in execution, promotes career growth, and cultivates a culture centered on customer focus, collaboration, diversity, and inclusion. If you are committed to improving engineers' productivity and satisfaction through reliable, scalable tool and service operations, consider joining ES365. Be at the core of Microsoft and help shape the future of Engineering Systems by raising the bar on availability, performance, and operational success. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal