About The Position

Who We Are: Roblox is reimagining the way people come together to connect, create, and express themselves. To support our massive scale, we are powered by thousands of microservices. The Application Networking team connects and secures these services, building ingress gateways, managing the service mesh, and enabling seamless communication across Roblox’s hybrid on-prem and cloud infrastructure. We are executing on Cloud Bursting, a strategic initiative to enable Roblox’s core services to transparently burst from on-prem data centers to the cloud, allowing us to handle historical peak concurrent players and survive regional failures. You will join our Service Mesh team to build the networking fabric that makes this possible.

Requirements

  • A professional with 3+ years of experience in distributed systems, with strong expertise in service mesh technologies (Istio, Envoy, Consul, or Linkerd).
  • You understand the tradeoffs of sidecar-based architectures at scale.
  • Deeply knowledgeable of service mesh concepts: traffic management, service discovery, mTLS, observability, and routing policies.
  • Experienced running Kubernetes in production and familiar with the K8s networking model.
  • Comfortable working with Envoy proxy internals, xDS APIs, and control plane architectures.
  • Excellent at designing large-scale distributed systems that span multiple clusters, regions, and runtime environments.
  • Fluent in Go, C/C++, or Rust.
  • Experienced with on-call rotations and approach infrastructure with a "reliability-first" mindset.

Responsibilities

  • Design and build the service mesh infrastructure that enables communication across Roblox’s hybrid Kubernetes and Nomad environments, supporting billions of daily requests.
  • Drive the integration of service mesh with Kubernetes, ensuring reliable sidecar injection, mTLS, traffic policies, and observability for production workloads.
  • Build the networking foundation for Cloud Bursting, enabling Roblox’s top services to transparently burst from on-prem data centers to the cloud, with service discovery, traffic management, and locality-aware routing across environments.
  • Partner with internal application teams to understand their connectivity pain points, ensuring the service mesh provides a frictionless 'paved road' for thousands of developers
  • Collaborate with the Gateway and CNI teams to deliver a unified, multi-cluster service fabric that abstracts cluster and regional boundaries from application developers.
  • Serve as a primary escalation point for complex service mesh issues, troubleshooting across Envoy sidecars, Istio control plane, and the underlying network stack.
  • Act as a senior voice on the team, mentoring junior engineers and promoting best practices in testing, deployment, and reliability engineering.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service