The Observability Engineering organization at CoreWeave is responsible for the platforms and practices that help engineers understand, operate, and improve production systems at scale. This team owns and evolves the foundations for metrics, logs, traces, telemetry pipelines, and observability reliability, enabling teams to detect issues quickly, troubleshoot complex distributed systems, and operate AI infrastructure with confidence. As CoreWeave continues to scale, observability plays a critical role in delivering reliable platform experiences, improving engineering velocity, and maintaining operational excellence across a rapidly growing cloud environment. CoreWeave is seeking a Senior Manager, Observability Engineering to lead a team responsible for building, scaling, and operating observability systems across metrics, logs, traces, and telemetry pipelines. In this role, you will define strategy and roadmap, drive platform reliability and performance improvements, and guide architectural decisions across observability infrastructure. You will partner closely with infrastructure, platform, security, and application engineering teams to improve instrumentation and production visibility. This role combines technical leadership, operational ownership, and team management to ensure observability platforms scale with business and customer needs.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed
Number of Employees
251-500 employees