Principal Software Engineer - Cluster Lifecyle

RobloxSan Mateo, CA
4hHybrid

About The Position

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone. As a Principal Software Engineer on the Compute Cell Lifecycle team you will create, support, and evolve the infrastructure at Roblox as we build out Roblox's private cloud. The Cell Lifecycle's mission is to create and manage a sustainable and reliable compute primitive across all backend environments (all on-prem and public cloud data centers) to all Roblox engineers. Come help us create, support, and evolve the infrastructure that manages the millions of containers that serve hundreds of millions of requests per second that power Roblox where you will have the opportunity to create long lasting impact on the entire company.

Requirements

  • 8+ years of experience
  • Experience working in the Kubernetes ecosystem. Prior experience building Kubernetes operators or building/running Kubernetes distributions preferred.
  • Strong proficiency in Go or other well structured programming languages.
  • Enjoy working on critical, large-scale, cross-platform, multi-tenant distributed systems.
  • Prefer building systems automation over operational and repetitive tasks.
  • An appreciation for working on observability and reliability to build long term sustainable systems

Responsibilities

  • Build and evolve a cell primitive for Roblox that runs the backends for the vast majority of Roblox’s compute workload.
  • Work closely with other teams in Compute and across the company to develop new features, support for new workloads, and define the right cross-system APIs as we expand the footprint of ‘cells’.
  • Safely and reliably manage a critical at-scale system.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service