Director of Software Engineering, Observability

Core WeaveNew York, NY
72d$206,000 - $303,000Hybrid

About The Position

CoreWeave is seeking a Director of Engineering to lead the development of our Observability product suite for AI/ML workloads. In this highly technical and strategic role, you will lead a world-class team to design, build, and operate observability solutions at scale. You'll collaborate closely with Product and Engineering to ensure our Observability delivers a unified experience across CoreWeave products.

Requirements

  • 10+ years of experience in infrastructure, or cloud systems.
  • 5+ years in engineering leadership roles, including hiring, scaling, and mentoring teams.
  • Proven track record of building and managing large-scale distributed systems or infrastructure.
  • Strong communication and interpersonal skills, able to convey storage engineering strategies and practices to technical and non-technical audiences.
  • Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field.

Nice To Haves

  • Prior experience in building telemetry solutions, such as logging, metrics and tracing.
  • Understanding of cloud computing infrastructure using Kubernetes.
  • You've worked at a cloud provider or hyperscaler and understand the scale and complexity.
  • You're familiar with telemetry, or multi-tenant compute environments.
  • You've scaled infrastructure for cloud services in production.
  • You're an expert at balancing cost, performance, and reliability in high-demand systems.

Responsibilities

  • Define and drive CoreWeave's Observability roadmap and strategy.
  • Lead and grow a high-performing team of software engineers and managers.
  • Design and implement advanced solutions, including low-latency, high-scale Observability pipelines across all products.
  • Build solutions that offer insights to customers for rapid troubleshooting of their AI workloads.
  • Champion initiatives to improve reliability, durability, and self-healing capabilities of Observability metrics, and assume operational responsibilities.
  • Develop operational review practices to assess performance against targets and iterating on those targets.
  • Mentor and guide engineering teams on best practices in product engineering, fostering a customer-focused approach to systems design and technical excellence.

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Professional, Scientific, and Technical Services

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service