SentinelOne-posted 3 months ago
$148,000 - $204,000/Yr
Senior
1,001-5,000 employees

Join SentinelOne as a Staff Infrastructure Engineer and play a crucial role in building infrastructure. SentinelOne’s XDR vision of one autonomous cybersecurity platform depends on our ability to collect, store, and analyze data efficiently. If you are interested in building infrastructure for data platforms that run reliably with 99.99% uptime, can scale to petabytes of data ingested every day (and >2 trillion events processed), and return queries with p95 latencies less than 5 seconds, you will love this opportunity.

  • Lead the design and operation of distributed data services—including Kafka and Redis—running at massive scale across Kubernetes clusters and multi-cloud environments.
  • Unlock complete cloud portability for SentinelOne’s services by building a highly automated, self-service infrastructure that can run seamlessly across AWS, GCP, and air-gapped on-prem environments.
  • Manage data infrastructure supporting 5+ PB/day ingestion, ensuring low-latency, high-throughput, and cost-effective operation at global scale.
  • Consolidate and optimize multi-tenant Kafka clusters to reduce cost, improve resilience, and streamline operations.
  • Drive Redis and Kafka lifecycle automation using GitOps principles (ArgoCD, Terraform), reducing operational toil and minimizing pager fatigue.
  • Define and implement standards for observability, HA, backup, and DR of stateful workloads in Kubernetes.
  • Partner with FinOps and engineering stakeholders to continuously optimize performance, cost, and operational overhead across data platform components.
  • Own the end-to-end platform experience for mission-critical open-source systems such as Kafka, Redis, and Cassandra, serving hundreds of product teams.
  • 8+ years of experience in infrastructure/platform engineering, with a proven track record of operating stateful distributed systems at scale.
  • Deep hands-on experience with Kafka and Redis running in Kubernetes, including performance tuning, scaling, partitioning, persistence, and operator-based lifecycle management.
  • Strong understanding of Kubernetes internals and best practices for managing both stateless and stateful workloads in production environments.
  • Experience providing Database- or Messaging-as-a-Service (DBaaS/PaaS) for internal development teams or external customers.
  • Exposure to multi-cloud environments with strong expertise in at least one major provider: AWS, GCP, or Azure.
  • Experience with Infrastructure as Code and GitOps practices (Terraform, ArgoCD, Pulumi).
  • Familiarity with advanced deployment strategies (blue-green, canary, rolling).
  • Strong scripting or development skills (e.g., Python, Go, or similar).
  • Solid understanding of CI/CD pipelines and workflow automation (GitHub Actions, Argo Workflows, etc.).
  • Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA
  • Unlimited PTO
  • Industry-leading gender-neutral parental leave
  • Paid Company Holidays
  • Paid Sick Time
  • Employee stock purchase program
  • Disability and life insurance
  • Employee assistance program
  • Gym membership reimbursement
  • Cell phone reimbursement
  • Numerous company-sponsored events, including regular happy hours and team-building events
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service