Senior Site Reliability Engineer

SentinelOne
73d$128,000 - $176,000

About The Position

We want to add a Senior SRE with extensive operations experience for a SaaS product, who can drive large-scale data infrastructure focusing on self-service and automation. You will also help keep our uptime promise to customers by ensuring we meet our SLOs. You will help our engineering teams ship software to our customers fast and with quality. In this job, you will have an amazing opportunity to drive outcomes that improve the reliability, stability, and cost efficiency of SentinelOne’s production services. You will join a like-minded team of awesome SRE engineers who help run our operations smoothly at scale.

Requirements

  • 3-5+ years of experience in running operations at a large scale for a SaaS product
  • 3-5+ years of production experience with orchestration systems like Kubernetes, Nomad, or Mesos
  • Python / Golang / Java / Ruby as main scripting languages (we use Python)
  • Familiarity with running Java and JavaScript applications including building and deploying
  • AWS experience and familiarity with other platforms like GCP and Azure
  • Experience using Terraform to set up cloud-native services
  • Familiarity with CI and practical delivery using Jenkins, GHA, ArgoCD, etc. or similar; familiarity with deployment strategies like blue-green, rolling deploys, canary deploys, and best practices around deployment automation
  • Keeping a pulse on the latest SRE trends
  • Ability to work in a diverse and distributed team is highly desired
  • Self-starter attitude with a passion and motivation for new technologies and empathy for legacy systems
  • Ability to learn quickly and navigate through unfamiliar programming languages, systems, and processes
  • Curiosity, desire to learn and improve, and great communication skills
  • Prior product-building experience is optional but strongly desired

Responsibilities

  • Drive continuous deployment
  • Command production incidents and drive the post-mortem process
  • Partner with product engineering teams to improve product quality and reliability
  • Simplify and automate operational tasks
  • Eliminate bottlenecks in SentinelOne infrastructure and services
  • Build tools to improve operations

Benefits

  • Medical, Vision, Dental
  • 401(k)
  • Commuter
  • Health and Dependent FSA
  • Unlimited PTO
  • Industry-leading gender-neutral parental leave
  • Paid company holidays
  • Paid sick time
  • Employee stock purchase program
  • Disability and life insurance
  • Employee assistance program
  • Gym membership reimbursement
  • Cell phone reimbursement
  • Numerous company-sponsored events including regular happy hours and team-building events
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service