Engineering Manager, Box Office Platform

CoreWeaveSunnyvale, CA
8h$143,000 - $210,000Hybrid

About The Position

The Box Office Platform team designs and builds the software that automates CoreWeave’s hardware asset tracking and in-house repair and hardware RMA processes. We develop back-end automation in golang for interfacing with Jira and diverse vendor ticketing systems, and also build user interfaces using React/Typescript for managing and assigning in-house hardware repair tasks to be performed by Datacenter Technicians. We are seeking an Engineering Manager for the Box Office Platform team who can manage and grow our team of developers. This individual will manage software development projects, communicate with customers and peer teams and drive reliability, results and customer satisfaction. As the manager of this team, you would have the opportunity to: Lead and grow a team of talented and dedicated golang and React/Typescript full-stack developers Manage the development and support of software products for hardware RMA tracking with vendors as well as in-house repair and maintenance operations. Oversee investments in global observability, operational health measurement, runbook development and maintenance, and automated remediation. Lead efforts to maintain and enhance existing software products, soliciting user feedback and prioritizing fixes to deliver high-quality software and responsiveness to users. Scope, define, and lead new project development aimed at increasing the efficiency and scaling of CoreWeave’s nodes under management. Develop a program of onboarding, documentation, enablement, and performance management to help your team members achieve new heights of personal growth and capability. Drive the culture and tone for and support each other and how they enable the rest of CoreWeave.

Requirements

  • 10+ years of experience in infrastructure engineering; at least 5 years in leadership roles managing mission-critical operations
  • Excellent cross-functional communicator and collaborator
  • Strong process-oriented mindset and ability to document, scale, and optimize complex workflows
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or equivalent experience.
  • You have a background that includes the knowledge and practice of SRE fundamentals, incident management, blameless culture, observability, and change management.
  • You’re comfortable with the idea of building and leading a team of high-performing, diverse engineers.
  • You believe in the value of automation and will champion practices that drive reliability and the adoption of cross-team processes and tooling.
  • You love helping people on their journeys to become their best selves and are comfortable extending the range of your influence to partners, peers, and senior leadership.

Nice To Haves

  • Experience managing global fleets of GPU/dense compute infrastructure
  • Familiarity with data center systems (RMA processes, racks, HVAC, networking)
  • Expertise in observability platforms (Prometheus, Grafana, alerting systems), and orchestration automation
  • Prior roles owning uptime, incident response, or reliability engineering in hyperscale environments

Responsibilities

  • Lead and grow a team of talented and dedicated golang and React/Typescript full-stack developers
  • Manage the development and support of software products for hardware RMA tracking with vendors as well as in-house repair and maintenance operations.
  • Oversee investments in global observability, operational health measurement, runbook development and maintenance, and automated remediation.
  • Lead efforts to maintain and enhance existing software products, soliciting user feedback and prioritizing fixes to deliver high-quality software and responsiveness to users.
  • Scope, define, and lead new project development aimed at increasing the efficiency and scaling of CoreWeave’s nodes under management.
  • Develop a program of onboarding, documentation, enablement, and performance management to help your team members achieve new heights of personal growth and capability.
  • Drive the culture and tone for and support each other and how they enable the rest of CoreWeave.

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service