Okta-posted 4 months ago
$168,000 - $227,000/Yr
Full-time • Senior
Bellevue, WA
5,001-10,000 employees

Okta is The World’s Identity Company. We free everyone to safely use any technology, anywhere, on any device or app. Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secure access, authentication, and automation, placing identity at the core of business security and growth. At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences. Join our team! We’re building a world where Identity belongs to you. This engineer will be in or near our Bellevue, WA office. This national security role necessitates a TOP SECRET/SCI security clearance and a favorable suitability review. It is a condition of employment that you obtain and continuously maintain this clearance for eligibility for access to classified information. Any inability to uphold these security standards may lead to termination. Our company is seeking a highly skilled Staff Site Reliability Engineer to join our team. We are a SaaS company specializing in securing large-scale systems. This role is a blend of software engineering and systems administration, where you'll be responsible for building and maintaining highly reliable, scalable, and secure infrastructure. You will be a key contributor, applying your expertise to automate manual processes and proactively solve complex problems before they become incidents, handling incidents, and includes on-call shifts.

  • Design, build, and maintain the core infrastructure that underpins our security SaaS offerings, ensuring high availability, performance, and scalability.
  • Develop robust automation using code to eliminate toil and ensure consistency across our environments.
  • Work closely with our security teams to embed a security-first mindset into all our processes and infrastructure.
  • Participate in on-call rotations and be a primary responder for critical incidents, leading root cause analysis and implementing preventative measures.
  • Partner with development, data science, and security teams to provide expert guidance on architectural decisions, best practices, and the implementation of new services.
  • Strong coding skills and comfortable writing production-level code to solve complex operational challenges.
  • Deep experience with Terraform for provisioning and managing cloud infrastructure and services.
  • Familiarity with modern CI/CD practices and tools, particularly Spinnaker.
  • Expertise in container technologies and hands-on experience managing large-scale, production-ready clusters with Kubernetes.
  • Experience with database schema management tools like Flyway.
  • Direct experience with large-scale data systems, specifically with the Snowflake platform.
  • Excellent analytical and problem-solving skills with a proactive approach to identifying and addressing potential issues.
  • Experience or a strong interest in AI/ML, particularly how these technologies can be applied to improve reliability, security, and operational efficiency.
  • Health, dental and vision insurance
  • 401(k)
  • Flexible spending account
  • Paid leave including PTO and parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service