Software Engineer - Systems Engineering

Rubrik Job BoardPalo Alto, CA
121d$152,400 - $228,700

About The Position

The Systems Engineering team within the Developer Platform organization is pivotal in ensuring the reliability and robustness of Rubrik's products. Our mission is to empower these products to excel in demanding use cases by rigorously focusing on key software reliability dimensions—stress, scale, longevity and resilience - through close collaboration with product component teams, product management, and release managers to integrate reliability early and ensure high-quality releases for our end customers. The key responsibility of the team includes: Drive release reliability certification for Rubrik Cloud Data Management and Rubrik Security Cloud-Private products. Architect and build scalable infrastructure and efficient pipelines for stress, scale and resilience/chaos testing. Develop and deploy simulators to optimize cost efficiency and accelerate testing. Enable Product teams with self-service tools and infrastructure for their validation needs. Maintain and evolve long-running, customer-like environments to proactively identify potential issues.

Requirements

  • BS or MS in Computer science or related field with a minimum of 2 years of relevant work experience
  • Strong knowledge of data structures, algorithms, and software design
  • Solid programming skills in one or more programming languages (Python preferred)
  • Ability to work collaboratively in a team environment, including quickly getting up to speed with new technologies
  • Building AI based applications/workflows using LLMs
  • Working knowledge of virtualization, container technologies, storage, database, network
  • Experience with Google Cloud Platform/AWS/Azure or other public cloud technologies

Nice To Haves

  • Building high scale & performant products
  • Knowledge of CI/CD solutions like Jenkins, Ansible, ELK

Responsibilities

  • Design & build infrastructure automation to enable on-demand building of complex product deployments similar to customer deployments and system stress/performance pipelines
  • Develop and enhance tools for monitoring, alerting and telemetry of customer-like deployments
  • Develop solutions for auto-triage and RCA of failures using AI technologies and self-healing of errors
  • Work with team members on detailed and constructive design and code reviews

Benefits

  • Bonus potential
  • Equity
  • Health benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service