As a senior leader on our team, you will be responsible for the overall health, performance, and reliability of our infrastructure, driving initiatives that maximize compute capacity and directly support our critical AI objectives. This role is a blend of strategic leadership and hands-on technical ownership. You will leverage your deep Site Reliability Engineering (SRE) expertise to build robust systems, lead high-stakes technical escalations, and champion customer success. We're seeking a proactive problem-solver with extensive experience in large-scale distributed systems, a track record of leading high-performing teams, and a passion for tackling the most challenging technical problems.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Number of Employees
251-500 employees