In this role, you will set the direction for SRE strategy and execution across a broad portfolio—driving operational maturity, resilience engineering, and automation—while partnering with senior technology and business leaders to deliver world-class availability, performance, and customer experience for the products our customers depend on. In this opportunity as Senior Director, Site Reliability Engineering , you will: Own 24x7 reliability outcomes for a portfolio of applications and services, ensuring customer-impacting issues are prevented, detected, and resolved quickly. Serve as the executive escalation leader during major incidents , providing calm, clear, and timely communication to senior leadership and key stakeholders. Partner with product engineering and platform teams to embed reliability into architecture, development, and release practices—ensuring operational readiness for new features and products. Define and implement best-in-class practices across observability, incident management, on-call operations, capacity planning, and disaster recovery. Drive automation and AI-assisted operations to improve efficiency and reduce mean time to mitigation/resolution (MTTM/MTTR). Lead, coach, and scale a global organization of SRE managers and engineers, building a culture of accountability, learning, and continuous improvement. Influence roadmaps and investment decisions to prioritize reliability, resiliency, and performance—while ensuring alignment with internal controls, external standards, certifications, and security requirements.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Director
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees