About The Position

CyberArk is building the cloud platform that secures machine identities at scale. We’re looking for a Staff Software Engineer to lead performance and reliability engineering across distributed, microservices-based systems. In this role, you will define how system behavior is measured, tested, and improved under real-world load — building the frameworks, tooling, and standards that enable teams to deliver scalable, production-ready services. You will operate at the intersection of quality, production engineering, and distributed systems, driving performance engineering as a core capability across the platform.

Requirements

  • 8+ years of experience in software engineering, SDET, performance engineering, or backend engineering roles
  • Strong coding skills in Go, Java, or Python, with experience building frameworks or system-level tooling
  • Proven experience designing and building performance testing systems, platforms, or frameworks
  • Deep understanding of performance testing methodologies (load, stress, soak, spike)
  • Deep understanding of workload modeling and traffic patterns
  • Deep understanding of latency (p50/p90/p99), throughput, and system behavior under load
  • Experience working with distributed systems and cloud-native architectures
  • Strong experience with observability platforms (Prometheus, Grafana, Datadog, etc.)
  • Experience integrating performance testing into CI/CD pipelines at scale
  • Solid understanding of Linux systems and networking fundamentals (TCP/IP, DNS, HTTP/S)
  • Experience driving technical direction or influencing engineering practices across multiple teams

Nice To Haves

  • Experience building or extending custom load testing frameworks (k6, Locust, Gatling, etc.)
  • Experience with containerized environments and orchestration (Kubernetes)
  • Familiarity with cloud infrastructure behavior under load (autoscaling, load balancing, storage systems)
  • Experience in SaaS or security-sensitive systems
  • Exposure to PKI, certificate lifecycle management, or identity systems
  • Experience with Infrastructure as Code (Terraform, Ansible)

Responsibilities

  • Define and drive performance and reliability engineering strategy across cloud-native, distributed systems
  • Design and build frameworks, tooling, and infrastructure for performance testing and system analysis at scale
  • Establish performance baselines, SLIs/SLOs, and measurable standards across services
  • Partner with engineering teams to embed performance testing into CI/CD workflows and development lifecycles
  • Analyze system behavior under load and identify bottlenecks across application, infrastructure, and data layers
  • Use observability data to interpret latency distributions, throughput, and failure patterns, and translate findings into actionable improvements
  • Build reusable performance tooling and test platforms adopted across multiple teams
  • Influence system design by advocating for scalability, resilience, and production readiness early in development
  • Drive adoption of performance and reliability practices across teams, acting as a technical leader and subject matter expert
  • Mentor engineers and contribute to raising the bar for system-level thinking and engineering quality

Benefits

  • bonus
  • medical
  • dental
  • vision
  • financial
  • other benefits

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

101-250 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service