Senior Site Reliability Engineer (SRE)

StubhubAliso Viejo, CA
2dHybrid

About The Position

StubHub is on a mission to redefine the live event experience on a global scale. Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way from the moment they start looking for a ticket until they step through the gate. The same goes for our sellers. From fans selling a single ticket to the promoters of a worldwide stadium tour, we want StubHub to be the safest, most convenient way to offer a ticket to the millions of fans who browse our platform around the world. StubHub is looking for Senior Site Reliability Engineer (SRE) to design and develop next-generation technologies and complex features. As a Senior SRE at StubHub, you will be at the forefront of tackling significant, ambiguous, and non-trivial challenges as a core contributor and innovator, bringing creative technical solutions to life. In order to ensure our company's success, our engineers must demonstrate initiative and enthusiasm for the problems they tackle. Location: Hybrid (3 days in office/2 days remote) - New York, NY or Santa Monica, CA or Aliso Viejo, CA

Requirements

  • Extensive experience (typically 5+ years) in a site reliability engineering or a related role, demonstrating a strong command of incident management, mitigation, & prevention, troubleshooting, and performance tuning.
  • Experience with developing robust, mission-critical systems using one or multiple general-purpose programming languages (e.g., C/C++, Java, C# or any other OOP language)
  • Experience with cloud computing (AWS, GCP, Azure)
  • A strong track record of aggressively identifying and removing toil through process optimization, automation and system design
  • Demonstrated ability to write and maintain code for automation, infrastructure orchestration, and reliability tooling.
  • Demonstrated understanding of large scale observability platforms and tools
  • Understanding of orchestration system such as Kubernetes

Responsibilities

  • Build out and maintain an observability platform to ensure the reliability, availability, and performance of critical systems.
  • Collaborate with cross-functional teams to identify and address potential bottlenecks, optimize resource utilization, and proactively prevent system failures.
  • Drive the implementation of automation tools and Infrastructure as Code (IaC) practices to streamline deployment processes, configuration management, and infrastructure provisioning.
  • Help develop a center of excellence, fostering a culture of empowering teams to continuously and reliably deliver customer value
  • Develop processes, tools and automation to reduce toil across engineering teams
  • Ensure Systems effectively balance cost, perfomance and reliability at scale

Benefits

  • Accelerated Growth Environment : An environment designed for swift skill and knowledge enhancement, where you have the autonomy to lead experiments and tests on a massive scale.
  • Top Tier Compensation Package : Competitive base, equity, and upside that tracks with your impact.
  • Flexible Time Off : Enjoy unlimited Flex Time Off, giving you the flexibility to manage your schedule and take time to recharge as needed.
  • Comprehensive Benefits Package : Prioritize your well-being with a comprehensive benefits package, featuring 401k, and premium Health, Vision, and Dental Insurance options.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service