We are seeking an experienced Major Incident Lead – Site Reliability to join our Managed Services team. This role is responsible for leading the response to high-severity, customer-impacting incidents across InterSystems’ managed services platforms. Acting as the Incident Commander, the role ensures rapid service restoration, clear and confident stakeholder communication, and disciplined coordination across SRE, engineering, support, cloud, and service delivery teams. Operating within an SRE-aligned service model, the Major Incident Lead focuses on protecting service reliability through the effective use of service level indicators and service level objectives, prioritizing customer impact reduction over root cause analysis during live incidents. Beyond incident response, the role drives post-incident reviews, turning operational failures into measurable reliability improvements and reduced repeat incidents. This position is critical to maintaining customer trust, platform resilience, and operational excellence in a 24x7, mission-critical, and highly regulated environment.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level