About The Position

The Forge Team: Engineering the Backbone of Rubrik's Platform The Forge team is at the core of Rubrik's mission to secure the world's data. As the platform and systems engineering team, founded by one of Rubrik's co-founders and CTO, our mission is to build a highly reliable, secure, and scalable software-defined platform. We are the architects and builders of the fundamental infrastructure, with a deep focus on the Operating System, Networking, Storage, Clustering, and Containerization. Our work involves tackling some of the most complex networking challenges for the world's largest enterprises. We design and own the container-based infrastructure that underpins Rubrik's cyber resilience products. We are responsible for providing seamless, zero-interruption horizontal scaling for both our on-prem and cloud customers, ensuring our platform can handle massive data growth without a hitch. We also act as the guardians of our clusters' health and performance. We build frameworks and services that proactively monitor our services and low-level system resources, ensuring our clusters are always available and performing at their maximum capacity. By owning the entire OS and kernel layer, we have the deep control and expertise necessary to optimize the foundational software that powers our platform. At our core, we are the trusted experts who can solve the most intricate operating system, Linux, and system-performance problems, pushing the boundaries of what's possible in enterprise infrastructure. We are looking for systems engineers who thrive on solving these complex, low-level challenges and are dedicated to building robust, long-lasting solutions that serve as the backbone for powering Rubrik's industry-leading data protection, security, and data management solutions. If you are passionate about building the foundation for a more resilient digital world, we encourage you to reach out. As a member of the Forge team, you'll be at the forefront of building the robust, software-defined platform that powers Rubrik's cyber resilience. Your work will span the full software development lifecycle, allowing you to have a significant and direct impact on the business.

Requirements

  • BS/MS/PhD in Computer Science or related field
  • 0-2 years of software development experience on Linux, preferably in Platform/Systems/Kernel or Networking domain
  • Strong fundamentals in data structures, algorithms, and distributed systems design
  • Strong background in Systems Programming
  • Expertise in debugging and troubleshooting performance and system-level issues
  • Proficient in Python, Go, and either C++, Java, or Scala

Nice To Haves

  • Experience with Linux kernel or equivalent and kernel debugging is a plus
  • Large distributed systems design and development experience is preferred
  • Knowledge of Storage, Filesystems, or Data Protection technologies is a plus

Responsibilities

  • End-to-End Ownership & Development
  • You'll take complete ownership of projects, from initial design to final deployment, architecting and delivering the next generation of our core infrastructure. This involves building services that are scalable, reliable, and high-performing.
  • Actively own team features and systems, defining their long-term vision and ensuring their health.
  • Collaborate with cross-functional teams to build features with multi-component dependencies.
  • Design and develop high-performance, scalable, and resilient system-level services.
  • Author detailed design documents, test plans, and automated tests to ensure high-quality results.
  • Solving Complex Customer Problems
  • You'll be a trusted expert and a champion for our customers, diagnosing and resolving the most intricate system and performance issues in their complex environments.
  • Collaborate closely with engineering and support teams to debug and resolve complex customer issues.
  • Proactively identify and bridge gaps in our product, processes, and tooling.
  • Operationalize monitors and tools to aid and scale the debugging process.
  • Assess the reliability of the Linux system stack and create new features to enhance error handling and performance.
  • Fostering Team Excellence
  • We believe in a high bar for quality and continuous improvement. You'll work alongside other talented engineers in a collaborative environment where you're encouraged to share your expertise and grow your skills.
  • Mentor new and junior engineers, helping them succeed in our fast-paced environment.
  • Contribute to the evolution of our engineering standards, tooling, and processes.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service