In this role, you will be a key pillar of our engineering organization, ensuring that our services remain highly available and performant. Your impact will include: System Architecture: Designing and implementing the next generation of our telemetry and alerting systems. Reliability Engineering: Defining SLOs/SLIs and ensuring our monitoring strategy captures the true health of the user experience. Operational Excellence: Reducing operational load through software; if you have to do it twice, you’ll want to automate it. Collaboration: Partnering with App Dev teams to influence the "design for reliability" phase of the software development lifecycle. Mentorship: Acting as a technical lead for junior members and off-shore partners, providing guidance on runbook development and disaster recovery.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed