Sr. Incident Manager

DoubleVerifyNew York, NY
1d

About The Position

The Senior Incident Manager leads DoubleVerify’s Major Incident Management (MIM) program, owning the end-to-end lifecycle of critical incidents—from detection and response through business impact assessment, communication, and post-incident improvement. This is a high-impact individual contributor role responsible for driving structured, cross-functional incident response across Engineering, Product, Commercial, and Executive teams. The role combines technical understanding with strong business judgment to minimize customer impact, protect revenue, and ensure clear decision-making during high-pressure situations.

Requirements

  • 7+ years in SRE, DevOps, Technical Operations, or Incident Management
  • Experience leading Sev1/Sev2 incidents in high-availability environments
  • Proven ability to coordinate cross-functional teams during critical outages
  • Solid understanding of distributed systems and cloud environments (AWS or GCP)
  • Experience with monitoring and incident tools (e.g., Datadog, Grafana, PagerDuty)
  • Comfortable working with logs, alerts, and system diagnostics
  • Strong communicator, including with executive stakeholders
  • Ability to translate technical issues into business impact
  • Comfortable driving decisions and alignment under pressure
  • Calm, decisive, and execution-focused
  • Able to push for action and maintain momentum
  • Experience improving processes and operational maturity

Nice To Haves

  • Experience in AdTech, digital media, or similar environments
  • Familiarity with SLOs/SLIs and reliability frameworks
  • Experience with automation or AI-driven incident tooling
  • ITIL or similar certification

Responsibilities

  • Lead Sev1–Sev3 incidents as the single point of accountability
  • Drive real-time decision-making, escalation, and coordination across teams
  • Run incident communications, including updates to executives and stakeholders
  • Translate technical issues into clear business impact
  • Own and improve the Major Incident Management process
  • Lead post-incident reviews and ensure follow-through on actions
  • Track key metrics (e.g., MTTR, incident trends) and drive improvements
  • Coordinate with Product, Commercial, and Legal on client communications when needed
  • Align incident response with business priorities, including customer and revenue impact
  • Improve tooling, automation, and workflows for incident response

Benefits

  • This role will also be eligible for bonus, equity, and benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service