Director, Observability Platform Engineering Technical Lead

Fidelity InvestmentsDurham, NC
1dHybrid

About The Position

Director, Observability Platform Engineering Technical Lead The Role We are seeking a highly experienced, hands-on Technical Lead and Build Owner to lead a dedicated team of software engineers responsible for delivering core platform capabilities within Fidelity’s Enterprise Observability Platform. This is not an SRE role; instead, you will focus on building and evolving the observability platform that Site Reliability Engineering and development teams depend on. In this role, you will define and drive the Observability Integrations roadmap, emphasizing scalable automation, security-by-design, and enterprise readiness across a complex hybrid–multi-cloud environment. You will lead the design, development, and support of enterprise-grade observability integrations with SaaS solutions such as Datadog, as well as open-source frameworks including OpenTelemetry (OTel) and Prometheus. The Expertise and Skills You Bring Technical Expertise Bachelor’s degree in a technology-related field (Computer Science, Engineering, etc.) or equivalent experience. Extensive hands-on engineering experience with Java, Go, and/or Python. Deep engineering experience with commercial and open-source observability platforms, including: Agent lifecycle management Agent release processes Platform governance FinOps best practices Experience designing, enabling, and managing observability capabilities across diverse technology stacks at enterprise scale. Strong understanding of observability patterns and practices, including: Distributed tracing Metrics and logs pipelines Synthetics Real User Monitoring (browser and mobile) Leadership & Strategic Skills Demonstrated ability to coach, mentor, and lead engineering teams to build scalable, resilient platform solutions. Strategic, forward-thinking mindset with a strong ability to identify patterns, simplify architectures, and create long-term platform value. Ability to define platform roadmap that blends automation, usability, security, and cost efficiency. Cloud, Security & Operational Knowledge Deep expertise in building and integrating security controls in public cloud environments. Strong understanding of modern IT service management practices and enterprise technology landscapes, including: Cloud delivery models: IaaS, PaaS, SaaS Automation frameworks Container platforms Auto-scaling and compute orchestration Networking, storage, and identity/access management Configuration, incident, problem, and asset management Logging, auditing, and compliance frameworks Personal Attributes Passion for technology and for delivering platform solutions that solve real business problems using cloud‑native architectures. Ability to work across organizational boundaries and communicate effectively with technical and non-technical stakeholders. Experience with Agentic AI a plus

Requirements

  • Bachelor’s degree in a technology-related field (Computer Science, Engineering, etc.) or equivalent experience.
  • Extensive hands-on engineering experience with Java, Go, and/or Python.
  • Deep engineering experience with commercial and open-source observability platforms, including: Agent lifecycle management, Agent release processes, Platform governance, FinOps best practices
  • Experience designing, enabling, and managing observability capabilities across diverse technology stacks at enterprise scale.
  • Strong understanding of observability patterns and practices, including: Distributed tracing, Metrics and logs pipelines, Synthetics, Real User Monitoring (browser and mobile)
  • Demonstrated ability to coach, mentor, and lead engineering teams to build scalable, resilient platform solutions.
  • Strategic, forward-thinking mindset with a strong ability to identify patterns, simplify architectures, and create long-term platform value.
  • Ability to define platform roadmap that blends automation, usability, security, and cost efficiency.
  • Deep expertise in building and integrating security controls in public cloud environments.
  • Strong understanding of modern IT service management practices and enterprise technology landscapes, including: Cloud delivery models: IaaS, PaaS, SaaS, Automation frameworks, Container platforms, Auto-scaling and compute orchestration, Networking, storage, and identity/access management, Configuration, incident, problem, and asset management, Logging, auditing, and compliance frameworks
  • Passion for technology and for delivering platform solutions that solve real business problems using cloud‑native architectures.
  • Ability to work across organizational boundaries and communicate effectively with technical and non-technical stakeholders.

Nice To Haves

  • Experience with Agentic AI a plus

Responsibilities

  • Lead the design, development, and support of enterprise-grade observability integrations with SaaS solutions such as Datadog, as well as open-source frameworks including OpenTelemetry (OTel) and Prometheus.
  • Define and drive the Observability Integrations roadmap, emphasizing scalable automation, security-by-design, and enterprise readiness across a complex hybrid–multi-cloud environment.
  • Lead a dedicated team of software engineers responsible for delivering core platform capabilities within Fidelity’s Enterprise Observability Platform.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service