IREN is a leading AI Cloud Service Provider, delivering large-scale GPU clusters for AI training and inference. IREN’s vertically integrated platform is underpinned by its expansive portfolio of grid-connected land and data centers in renewable-rich regions across the U.S. and Canada. With 100% renewable energy, we build, own and operate our data centers and take pride in being at the forefront of sustainable solutions for the ever-evolving applications of high-performance compute. We believe that human progress is invaluable, but it should be done in the right way - responsibly, sustainably and having a positive impact on the communities we operate in. We are seeking a highly capable Incident Commander to operate at the center of critical operations supporting our HPC Data Center Operations. This role is responsible for leading the coordinated response to high-severity incidents, major outages, and critical service degradation events across HPC infrastructure and customer-facing production systems. The individual will serve as the operational command authority during major incidents, driving rapid detection, coordinated technical response, executive communication, service restoration, and post-incident operational improvement. The successful candidate must demonstrate operational leadership under pressure, the ability to coordinate cross-functional engineering organizations without direct authority, and the discipline to drive structured incident response during high-impact operational events.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior