About The Position

This is a high-impact leadership opportunity to shape how enterprise technology operations evolve in the age of AI. The AVP will influence strategy across observability, resilience, incident intelligence, and automation at enterprise scale, while working on some of the most important challenges in modern operations: reducing noise, accelerating recovery, improving customer stability, and enabling safe, governed adoption of AI in production operations. This leader will play a central role in advancing intelligent triage, summarization, root cause hypothesis generation, runbook automation, and self-healing capabilities in a controlled enterprise environment.

Requirements

  • Undergraduate degree or Technical Certificate
  • 15+ years of development and technology delivery experience
  • Agile Delivery Experience preferred
  • Strong experience with APM, Event management, Operational Automation and Reporting Platforms Including Dynatrace, Datadog, Splunk PagerDuty, RunDeck, PowerBI.
  • Demonstrated ability to influence senior stakeholders, lead through ambiguity, align teams to measurable outcomes, and create a culture of accountability, innovation, and continuous improvement.

Responsibilities

  • Define and execute the enterprise strategy for Observability, AIOps, SRE, and incident intelligence.
  • Lead global teams responsible for the platforms, practices, and product roadmap that enable 24x7 operational visibility and resilience across critical technology services.
  • Drive the next phase of transformation from reactive monitoring to predictive, AI-assisted operations by advancing telemetry standards, improving signal quality, and scaling automation across the incident lifecycle.
  • Partner closely with engineering, platform, and application teams to embed observability into design, build, and run practices, while also leading vendor strategy, platform rationalization, and cost optimization.
  • Build trusted partnerships across engineering, infrastructure, architecture, risk, and control functions to deliver secure, stable, and scalable operational outcomes.
  • Establish governance and guardrails for AI and LLM-enabled operational workflows, with a strong focus on explainability, auditability, resiliency, and data protection.
  • Improve service availability and reduce customer-impacting outages.
  • Enable faster recovery and proactive issue prevention.
  • Enhance reliability of digital and core banking platforms.

Benefits

  • health and well-being benefits
  • savings and retirement programs
  • paid time off (including Vacation PTO, Flex PTO, and Holiday PTO)
  • banking benefits and discounts
  • career development
  • reward and recognition
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service