Roblox-posted 13 days ago
Full-time • Mid Level
San Mateo, CA
1,001-5,000 employees

The Infrastructure Storage organization is seeking a Senior Software Engineer to join the Queue team. In this role, you will play a critical part in evolving and operating our in-house, large-scale distributed queue system based on Apache Kafka. Your work will directly support Roblox's ever-increasing need for high-performance queue business capabilities, as the system currently handles tens of trillions of events daily. You will: Lead the advancement and massive scaling of our 24x7 distributed queue service. This critical system is vital for handling a wide range of services, including analytics, matchmaking, trust-safety, economy, and database changefeed events. A key objective is to transition current Amazon SQS traffic to this in-house platform Play a leading role in the design, implementation, and operation of our internal Infra-as-a-Service (IaaS) offerings built on top of a container orchestration platform Design and build robust frameworks and tools to automate the development, testing, cluster management, and monitoring of our mission-critical services Drive improvements to the Service Level Agreements (SLAs) for our offerings and reduce the end-to-end rollout time for our suite of software solutions

  • Lead the advancement and massive scaling of our 24x7 distributed queue service.
  • Play a leading role in the design, implementation, and operation of our internal Infra-as-a-Service (IaaS) offerings built on top of a container orchestration platform
  • Design and build robust frameworks and tools to automate the development, testing, cluster management, and monitoring of our mission-critical services
  • Drive improvements to the Service Level Agreements (SLAs) for our offerings and reduce the end-to-end rollout time for our suite of software solutions
  • Proven experience in designing, delivering, and operating large-scale distributed systems that manage billions of real-time requests per second
  • Deep expertise in one or more messaging technologies, such as Apache Kafka or similar systems
  • A builder's mindset and experience running large-scale Active/Active distributed systems leveraging container orchestrators (e.g., Kubernetes or Nomad) and service discovery systems (e.g., Consul)
  • Proficiency in programming languages like Go, Java, or C++
  • A Bachelor's degree in Computer Science (or equivalent professional experience) with at least 3 years of hands-on working experience
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service