Senior Site Reliability Engineer (SRE)

Stubhub•Aliso Viejo, CA

32d•Hybrid

About The Position

StubHub is on a mission to redefine the live event experience on a global scale. Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way from the moment they start looking for a ticket until they step through the gate. The same goes for our sellers. From fans selling a single ticket to the promoters of a worldwide stadium tour, we want StubHub to be the safest, most convenient way to offer a ticket to the millions of fans who browse our platform around the world. StubHub is looking for Senior Site Reliability Engineer (SRE) to design and develop next-generation technologies and complex features. As a Senior SRE at StubHub, you will be at the forefront of tackling significant, ambiguous, and non-trivial challenges as a core contributor and innovator, bringing creative technical solutions to life. In order to ensure our company's success, our engineers must demonstrate initiative and enthusiasm for the problems they tackle. Location: Hybrid (3 days in office/2 days remote) - New York, NY or Santa Monica, CA or Aliso Viejo, CA

Requirements

Extensive experience (typically 5+ years) in a site reliability engineering or a related role, demonstrating a strong command of incident management, mitigation, & prevention, troubleshooting, and performance tuning.
Experience with developing robust, mission-critical systems using one or multiple general-purpose programming languages (e.g., C/C++, Java, C# or any other OOP language)
Experience with cloud computing (AWS, GCP, Azure)
A strong track record of aggressively identifying and removing toil through process optimization, automation and system design
Demonstrated ability to write and maintain code for automation, infrastructure orchestration, and reliability tooling.
Demonstrated understanding of large scale observability platforms and tools
Understanding of orchestration system such as Kubernetes

Responsibilities

Build out and maintain an observability platform to ensure the reliability, availability, and performance of critical systems.
Collaborate with cross-functional teams to identify and address potential bottlenecks, optimize resource utilization, and proactively prevent system failures.
Drive the implementation of automation tools and Infrastructure as Code (IaC) practices to streamline deployment processes, configuration management, and infrastructure provisioning.
Help develop a center of excellence, fostering a culture of empowering teams to continuously and reliably deliver customer value
Develop processes, tools and automation to reduce toil across engineering teams
Ensure Systems effectively balance cost, perfomance and reliability at scale

Benefits

Accelerated Growth Environment : An environment designed for swift skill and knowledge enhancement, where you have the autonomy to lead experiments and tests on a massive scale.
Top Tier Compensation Package : Competitive base, equity, and upside that tracks with your impact.
Flexible Time Off : Enjoy unlimited Flex Time Off, giving you the flexibility to manage your schedule and take time to recharge as needed.
Comprehensive Benefits Package : Prioritize your well-being with a comprehensive benefits package, featuring 401k, and premium Health, Vision, and Dental Insurance options.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume