Systems Engineering Associate - GovCloud [Salesforce National Security]

SalesforceHerndon, VA
$111,000 - $122,000Onsite

About The Position

Salesforce is seeking an engineering candidate to join the Site Reliability organization. Working closely with counterparts in the Infrastructure and R&D organizations, this organization provides a team of engineers monitoring cloud service availability and ready to swiftly repair any service-impacting issues. Seven days a week, 24 hours a day, the Site Reliability team keeps the Salesforce cloud and our customers protected. As a member of the Site Reliability team, you will be responsible for the primary task of detecting and resolving incidents within minutes. This objective is met by supervising the services, reacting to problems, and proactively addressing issues before they affect performance or availability. The team contributes to the customer and Salesforce by securing data through monitoring, automation, self-healing and resiliency initiatives, destructive testing, and game day exercises. The incumbent in this role would demonstrate a solid focus on tactical operations, as well as large-scale production engineering and orchestration.

Requirements

  • U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship. You must have an active TS/SCI with polygraph position with the U.S. federal government or other clearances as deemed appropriate for the role
  • A related technical degree required
  • Systems engineering experience in enterprise scale internet service engineering or support role
  • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.)
  • Expertise in CLI enterprise support of Unix variants (Linux/Solaris/BSD) as well as strong Linux/UNIX knowledge with significant exposure to Red Hat Enterprise Linux and Solaris
  • Solid understanding of monitoring security systems and administration
  • Good interpersonal skills (Written and Oral)
  • Past experience in Incident Management and good understanding of ITIL service operations
  • Experience in working in a 24/7 team managing large data centers
  • Be available to work shift work if required (1400 - 2200 x 4 days a week).
  • Experience provisioning, operating, and running AWS/C2S based infrastructure and systems
  • Understand and have experience with writing scripts in Python, Go, or other languages
  • This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and has an active TS/SCI with polygraph.
  • This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and agrees to complete a U.S. federal government Minimum Background Investigation (MBI) for a Moderate Public Trust position.
  • This position requires a USA TS/SCI with Polygraph security access level.

Nice To Haves

  • Prior Chef/Puppet or automated deployment experience
  • Prior Jenkins/Bamboo/Spinnaker pipeline execution experience
  • Experience in supporting and maintaining a monitoring and alert systems
  • Experience in supporting and maintaining Java applications
  • Hands on experience configuring and running AWS (Amazon Web Services), using the CLI/SDKs
  • Experience managing systems monitoring and alerts.
  • Certifications in Linux+, RedHat and AWS
  • Experience in supporting and leading Kubernetes based applications and services
  • Familiar with Agile Process and DevOps

Responsibilities

  • Keep the customer-facing services available at top performance by maintaining the constant health of the supporting systems.
  • Incident management - Act in key support roles during major incidents e.g. Sev0, Sev1. Also, participate in the technical review of the incident for problem management
  • Problem Management - populate and participate in RCAs and hand them off to the Global Solutions team
  • Ensuring that work carried out by the Site Reliability team is performed in such a way as to stay in sync with the company’s internal compliance policy and directives
  • Passionate about solving technical issues and customer concerns with other technical staff as required.
  • Work with and lead other members of the team in staying on top of key industry innovation and technology, and assist in team development growth
  • Ability to operate in the fast paced environment and solve sophisticated issues quickly successfully balance multiple priorities
  • Work to automate detection and resolution of recurring issues in the production environment
  • Help create and improve current processes to reduce operations and engineering toil

Benefits

  • time off programs
  • medical
  • dental
  • vision
  • mental health support
  • paid parental leave
  • life and disability insurance
  • 401(k)
  • employee stock purchasing program
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service