ServiceNow-posted 4 months ago
Orlando, FL
5,001-10,000 employees

This position is based in our Orlando, FL office. We are looking for an AI-native product leader with a strong background in network observability, cloud networking, or infrastructure platforms to drive the next generation of intelligent, compliance-aware observability for hyperscaler and sovereign cloud environments. This role goes beyond traditional monitoring — you will design and deliver capabilities that leverage AI/ML, LLMs, and agentic automation to: Predict network degradations before they impact services. Automate compliance validation for sovereign workloads. Orchestrate autonomous remediation workflows that minimize human intervention. You will own the end-to-end product lifecycle, from vision and architecture definition to adoption and continuous improvement, ensuring network observability evolves into a self-healing, compliance-assured platform.

  • Define and execute a multi-year AI-native network observability strategy spanning hyperscale and sovereign cloud deployments.
  • Establish a vision for predictive, automated observability — moving from reactive monitoring to autonomous, intelligence-driven operations.
  • Anticipate technology and regulatory changes (AI governance, sovereign cloud evolution, hyperscaler service updates) to keep the platform ahead of the curve.
  • Design and deliver AI/ML-powered telemetry pipelines that analyze high-volume network data (flow logs, packet captures, metrics) for anomalies and degradation patterns.
  • Embed LLM-based reasoning agents for root cause analysis, network impact assessment, and remediation recommendations.
  • Implement agentic AI workflows to detect compliance deviations (data residency, encryption policy breaches) and automate configuration drift detection and corrective actions.
  • Simulate network failure scenarios for resilience validation.
  • Integrate with hyperscaler-native capabilities, ensuring AI inference models work in both standard and sovereign cloud environments.
  • Build observability solutions that are compliance-aware by design, including sovereign-certified encryption and key management.
  • Ensure AI models are trained and executed in sovereign-compliant environments without violating jurisdictional restrictions.
  • Support air-gapped and low-connectivity deployments with edge AI inference capabilities.
  • Partner with SRE, cloud networking, compliance, and AI/ML engineering teams to deliver production-grade AI observability features.
  • Collaborate with governance teams to align AI observability outputs with audit, risk, and compliance frameworks.
  • Drive alignment with architecture and operations teams to standardize AI-driven operational playbooks.
  • Define AI performance KPIs such as prediction accuracy, false positive rates, automated resolution percentage, and compliance drift detection rate.
  • Continuously refine AI models using telemetry feedback loops, ensuring high accuracy and relevance.
  • Measure operational impact — reductions in MTTR, incident frequency, and compliance breach risk.
  • 10+ years in product management or platform ownership in networking, observability, or cloud infrastructure.
  • 3+ years of experience integrating AI/ML into products.
  • Deep understanding of network protocols, routing, telemetry standards (NetFlow, sFlow, IPFIX), and packet analysis.
  • Proven experience with hyperscaler networking architectures (AWS, Azure, GCP) and sovereign cloud environments.
  • Familiarity with AI/ML frameworks (TensorFlow, PyTorch) and LLM integration into operational systems.
  • Understanding of compliance frameworks (GDPR, FedRAMP, CMMC) and their implications on AI data handling.
  • Proficiency with observability stacks (OpenTelemetry, Prometheus, Grafana, Splunk) and automation tooling.
  • Experience designing AI-native observability platforms for regulated, multi-cloud environments.
  • Background in agentic AI workflows for infrastructure operations.
  • Expertise in air-gapped AI deployments and sovereign-compliant ML pipelines.
  • Track record of delivering autonomous network remediation systems.
  • Flexible work personas (flexible, remote, or required in office).
  • Equal opportunity employer.
  • Accommodations for candidates requiring assistance in the application process.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service