NVIDIA is looking for a skilled and motivated Senior Site Reliability Engineer (SRE) to join our team in Santa Clara, CA. This role combines real-time incident leadership with hands-on engineering, focused on improving how we detect, respond to, and prevent issues at scale. You will operate as an Incident Commander during critical events while building the systems, automation, and observability that reduce operational toil and improve reliability over time.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior