About The Position

We are looking for an Incident Manager to spearhead incident management operations for Wiz’s Federal and Sovereign Cloud environments. You will lead the program development, strategic response, and technical recovery efforts for high-impact events, ensuring every incident is met with a structured, compliant, and decisive resolution.

Requirements

  • 7+ years of experience leading crisis management and incident response programs in FedRAMP High, IL5, or NIST 800-53 environments.
  • Direct experience in managing and leading major incidents
  • Direct experience working cloud environments, AWS required (other clouds a plus)
  • Experience working with cloud native technologies like containers and container orchestration platforms like Kubernetes.
  • Ability to interpret metrics and logs in observability and security event management tools such as Grafana, Prometheus, DataDog, Splunk, etc.
  • Experience with incident management platforms such as PagerDuty, ServiceNow, or Jira, including experience building automated notification trees and dashboards.
  • Strategic thinking and a risk focused mindset on reliability improvements
  • Ability to identify systemic gaps that feed back into program design and operations teams
  • Strong writing and documentation skills to effectively communicate with both technical and business audiences
  • Ability to maintain composure and exercise sound judgement while navigating high-stake decision making during complex and ambiguous incidents
  • Candidates must meet EAR part 772 and ITAR 120.15 definition of a U.S. person (Any individual who is granted U.S. citizenship; or any individual who is granted U.S. permanent residence (green card holder); or any individual who is granted status as a “protected person”) and that they reside in the contiguous United States.

Responsibilities

  • Serve as the lead incident coordinator for high-severity events, activating playbooks, declaring incident severity, and coordinating with functional leads to drive a structured response.
  • Define, operationalize, and document the end-to-end incident response lifecycle that aligns to FedRAMP High, IL5, and NIST 800-53 requirements.
  • Drive readiness activities by designing and facilitating cross-functional tabletop exercises, hands-on simulations exercises, incident response team training, and review of playbooks to validate response protocols.
  • Facilitate Root Cause Analysis by leading post-incident reviews using structured methodologies and documentation to separate root causes from contributing factors and drive business-wide corrective actions to closure.
  • Serve as the primary liaison between technical and business units by translating incident details into business impact assessments that drive informed decision-making for legal, compliance, and operational teams.
  • Bridge technical and operational responses by building communication paths between engineering, operations, legal, compliance, and customer facing teams to translate complex incidents into actionable updates for leadership.
  • Establish centralized reporting, dashboards, and KPIs to monitor response efficiency, trend analysis, and program maturity.
  • Manage and optimize incident response tools like ServiceNow, PagerDuty, and Jira to ensure.

Benefits

  • Medical, dental and vision insurance
  • Home Office Setup reimbursement
  • Flexible Spending Accounts
  • Monthly Connectivity reimbursement
  • Employee Assistance Program (EAP)
  • Short- and Long-term Disability Insurance
  • Life & Accident Insurance
  • 401(k) Retirement Savings Plan (with employer match)
  • Flexible paid time off + 11 paid holidays
  • Paid leave programs, including parental, pregnancy health, medical and bereavement leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service