Senior Site Reliability Engineer, Atlas

MongoDBBoston, MA
87d$118,000 - $231,000

About The Position

MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure. Atlas allows customers to build and run applications anywhere—on premises, or across cloud providers. With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it’s no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications. We are looking for an experienced Senior Engineer for our SRE, Atlas team to support, maintain and grow the Atlas platform. As a senior SRE, you will be expected to be able to design & build complex systems, operate with autonomy and act as owner for everything you do. The SRE Atlas team works alongside the various Atlas software engineering teams to provide expertise about running systems at scale, build new tooling and automation and perform essential maintenance of the Atlas fleet. This is an SRE team, which means you can expect a highly hands-on approach, tackling the technical challenges of implementing large scale solutions that have the ability to impact our customer’s most crucial workloads.

Requirements

  • 5+ years of experience running critical systems at scale
  • Value efficiency in processes and operations, and display a preference for automation over manual processes
  • Familiarity with a major cloud provider (AWS, Azure, or GCP) and the ability to build and operate systems in a multi-cloud environment
  • Strong understanding of how to run a large scale Linux environment, including low level fundamentals
  • Firm grasp of at least one modern programming language, beyond basic scripting (Go, Ruby, Python)
  • Solid understanding of web and network protocols and standards (HTTP, TLS, DNS, etc)

Responsibilities

  • Participate in the development of a reliable and resilient multi-cloud platform that hosts business critical applications for a wide & varied range of customer applications
  • Collaborate with service-owning teams to provide internal support, solve technical challenges and adapt or build tooling to solve novel use cases in a generic fashion
  • Participate in a 24/7 on-call rotation to swiftly resolve issues related to any disruption of our customer facing Atlas fleet, ensuring minimal disruption and high availability

Benefits

  • Equity
  • Participation in the employee stock purchase program
  • Flexible paid time off
  • 20 weeks fully-paid gender-neutral parental leave
  • Fertility and adoption assistance
  • 401(k) plan
  • Mental health counseling
  • Access to transgender-inclusive health insurance coverage
  • Health benefits offerings
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service