At NVIDIA, we pride ourselves on data-driven decision-making, and the data science platform team is at the heart of this initiative. NVIDIA runs some of the most demanding AI, data, and platform workloads on the planet and none of it works without a reliable, high-scale observability foundation. We’re hiring an Engineering Manager to lead the team that builds and operates NVIDIA’s global observability platform: the system that carries every metric, log, trace, profile, and event our engineers rely on to understand and debug their services. This isn’t a traditional people-manager role. You’ll stay close to the technology, guide architecture decisions, review designs and code, and help the team solve real distributed-systems challenges. You’ll work with engineers to shape how services instrument themselves, how we ingest and store high-cardinality telemetry, and how observability fits cleanly into NVIDIA’s broader platform ecosystem. You’ll partner directly with platform, infrastructure, and application teams to evolve how telemetry flows across metrics, logs, traces, profiling, and events. You’ll coach and mentor engineers, build strong technical habits, and drive a roadmap that keeps the platform reliable and ready for NVIDIA’s rapid growth. If you enjoy deep technical work, high-throughput pipelines, open-source observability stacks, and helping engineers do the best work of their careers, this role is built for you.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager
Number of Employees
5,001-10,000 employees