Staff Site Reliability Engineer - Platform

IonQCollege Park, MD
22hHybrid

About The Position

IonQ is developing the world's most powerful full-stack quantum computer based on trapped-ion technology. We are pushing past the limits of classical physics and current supercomputing technology to unlock a new era of computing. Quantum computing has the potential to impact every area of human society for the better. IonQ’s computers will soon redefine industries like medicine, materials science, finance, artificial intelligence, machine learning, cryptography, and more. IonQ is at the forefront of this technological revolution. Want to help us build the world's largest and most precise quantum computing platform? We trap atoms in a vacuum chamber and manipulate them with lasers to perform calculations that we intend to scale beyond the reach of today's supercomputers. Unfortunately, the lasers don’t prevent bugs. We’re looking for a Site Reliability Engineer to help increase performance, decrease latency, and ensure that the world’s best quantum computers have the best possible uptime.

Requirements

  • BS degree in Computer Science, Computer Engineering, or equivalent practical experience
  • 8+ years of professional experience or an equivalent combination of education and experience
  • 5+ years experience in site reliability engineering
  • 3+ years experience with Kubernetes
  • Experience with learning from incidents
  • Experience with virtualized and containerized environments
  • Experience operating and debugging Unix/Linux OS internals (e.g., filesystems, inodes, system calls) and/or networking (e.g., TCP/IP, routing, network topologies and hardware, SDN)
  • Strongly capable in a scripting language of your choice (Shell, Python, etc.)
  • Able to identify processes in need of automation quickly, and automate them
  • Able (and excited) to mentor junior engineers
  • Excellent writer, capable of driving best practices throughout the org

Nice To Haves

  • 10+ years of experience in software development
  • 5+ years of experience with VMware and Terraform
  • Comfort with Google Cloud
  • Experience with scaling databases and applications
  • Experience with deploying bare-metal Kubernetes
  • Experience with incident management and leading incident resolution
  • Experience with incident research and analysis of contributing factors

Responsibilities

  • Be the first site reliability engineer at IonQ dedicated to the cloud team!
  • You’ll create, support, and manage infrastructure, instrumentation, and tooling for both our product and the engineering teams.
  • You’ll be key in providing reliable services to our customers, and a force multiplier for our engineers by helping us eliminate toil and scale our systems sustainably.
  • Maintain monitoring and alerting systems deployed on Kubernetes (both self-managed on-prem and in the cloud) and on Linux workstations.

Benefits

  • comprehensive medical, dental, and vision plans
  • matching 401K
  • unlimited PTO and paid holidays
  • parental/adoption leave
  • legal insurance
  • a home internet stipend
  • pet insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service