SRE Systems Engineer

SalesforceWashington DC, VA

About The Position

Join our Site Reliability Engineering (SRE) team, where you'll work alongside Infrastructure and Research & Development (R&D) partners to keep Salesforce cloud services available for customers around the clock. In this role, you'll detect and resolve incidents fast, drive automation, and help build the resilient systems that millions of customers depend on every day. This role is based in [insert location].

Requirements

  • Bachelor's degree in Computer Science, Information Systems, or a related technical field, or equivalent work experience.
  • Experience in enterprise-scale internet service engineering or support, with strong Command Line Interface (CLI) knowledge of Unix variants including Red Hat Enterprise Linux.
  • Expertise in Transmission Control Protocol/Internet Protocol (TCP/IP) networking technologies and protocols.
  • Demonstrated experience with incident management and a solid understanding of IT Infrastructure Library (ITIL) service operations in a 24/7 environment.
  • Proficiency writing scripts in Python, Go, or similar languages, with experience provisioning and operating Amazon Web Services (AWS) infrastructure.
  • This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and agrees to complete a U.S. federal government Minimum Background Investigation (MBI) for a Moderate Public Trust position.
  • This position requires a USA TS/SCI with Polygraph security access level.

Nice To Haves

  • Experience with configuration management tools such as Chef or Puppet, and pipeline tools such as Jenkins, Bamboo, or Spinnaker.
  • Hands-on experience supporting and managing Kubernetes-based applications and services.
  • Certifications in Linux+, Red Hat, or AWS.
  • Familiarity with Agile processes and DevOps practices.

Responsibilities

  • Monitor customer-facing services and respond to Severity 0 (Sev0) and Severity 1 (Sev1) incidents, leading technical reviews and contributing to Root Cause Analyses (RCAs) handed off to the Global Solutions team.
  • Automate the detection and resolution of recurring production issues to reduce engineering and operations toil.
  • Contribute to compliance, resiliency, and self-healing initiatives including destructive testing and game day exercises.
  • Partner with and mentor team members to stay current on industry technology and drive team development.

Benefits

  • time off programs
  • medical
  • dental
  • vision
  • mental health support
  • paid parental leave
  • life and disability insurance
  • 401(k)
  • employee stock purchasing program
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service