Site Reliability Engineer

$119,000 - $161,000/Yr

LiveRamp - San Francisco, CA

posted 2 months ago

Full-time - Mid Level
San Francisco, CA
Professional, Scientific, and Technical Services

About the position

The Site Reliability Engineer (SRE) at LiveRamp plays a crucial role in supporting and maintaining the deployment of global products, ensuring their availability and operational efficiency. This position involves providing 24/7 engineering support, driving resolutions for core product issues, and enhancing infrastructure reliability through monitoring and alerting. The SRE will collaborate closely with various engineering teams across the globe, contributing to the development of operational documentation and best practices in site reliability engineering.

Responsibilities

  • Support and/or own the deployment of global products including setting up production and internal environments
  • Provide 24/7 first line of Engineering support for issues related to global product deployment and internal operations support
  • Drive effective resolutions of core product issues with Engineering teams
  • Setup and maintain Infrastructure & Product Reliability monitoring and alerting
  • Maintain and enhance CI/CD Tooling and Terraform scripts in collaboration with the DevOps team
  • Maintain and enhance Engineering Operational Documentation for supported products
  • Provide expertise to build and maintain products operational documentation and set up product SRE practices
  • Support Security and Compliance governance in production environments
  • Work in close collaboration with SRE team members and Engineering organizations across various global locations.

Requirements

  • 3+ years of experience in SRE, DevOps, or production engineering
  • Experience in Infrastructure as code (IaC) using Terraform
  • Experience in building continuous integration declarative pipelines in Jenkins or CircleCI
  • Experience with platforms like Kubernetes, Containers, and public clouds (Google Cloud Platform or AWS)
  • Experience with deployment and monitoring of highly scalable products
  • Experience in Python or Go programming language
  • Experience with SRE best practices and observability principles
  • Ability to diagnose technical problems, debug code, and automate routine tasks
  • Experience with securing systems in a public cloud environment
  • Ability to engage other engineers as stakeholders
  • Enjoy working as part of a distributed team.

Nice-to-haves

  • Working knowledge of observability principles is a big plus

Benefits

  • Work with talented, collaborative, and friendly people
  • Enjoy catered meals, boundless snacks, and food trucks
  • Participate in events such as game nights, happy hours, camping trips, and sports leagues
  • Every employee is a stakeholder in the company's future
  • Comprehensive health, dental, vision, and disability insurance
  • 401k matching plan
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service