Site Reliability Engineering (SRE) is an engineering discipline that combines software development and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for the availability and reliability of our firm's most critical platform services and ensures they meet the requirements of our internal and external users. We also develop and operate the observability platforms that all other engineering teams use to make their services reliable. We look for engineers who are motivated to collaborate with other engineering teams and our businesses to build and run sustainable production systems, which can evolve and adapt to changes in our fast-paced, global business and regulatory environment.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Entry Level
Number of Employees
5,001-10,000 employees