Owns the operational health, stability, and reliability of enterprise infrastructure platforms, ensuring high availability, performance, and compliance across compute, storage, virtualization, cloud, and network environments. Operating within a run-focused model, this role is accountable for incident response, system performance, and continuous operational improvement. Serves as the senior technical escalation point for complex infrastructure issues, leading rapid resolution, root cause analysis, and long-term remediation planning. Partners closely with engineering and architecture teams to transition new capabilities into stable operations, ensuring solutions are supportable, observable, and aligned with enterprise standards. Drives automation, observability, and AI-enabled operational practices to improve efficiency, reduce manual intervention, and enhance overall service reliability.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior