Platform Operations 2

HDROmaha, NE
Onsite

About The Position

At HDR, our employee-owners are fully engaged in creating a welcoming environment where each of us is valued and respected, a place where everyone is empowered to bring their authentic selves and novel ideas to work every day. As we foster a culture of inclusion throughout our company and within our communities, we constantly ask ourselves: What is our impact on the world? Watch Our Story:' https://www.hdrinc.com/our-story'

Requirements

  • Bachelor’s degree in Information Technology, Computer Science, Engineering, or related field, or equivalent experience.
  • Minimum 3 years of experience in infrastructure monitoring, systems operations, platform support, or operational engineering.
  • Experience with observability or monitoring platforms in enterprise infrastructure environments.
  • Experience supporting incident management processes and operational escalations.
  • Working knowledge of VMware vSphere and virtual infrastructure concepts.
  • Experience defining or reporting on service reliability metrics, SLAs, or SLOs.
  • Working knowledge of scripting for automation and data collection.

Nice To Haves

  • Experience with VMware Cloud Foundation Operations / vRealize Operations.
  • Experience integrating observability platforms with Dynatrace, ServiceNow, or similar enterprise tools.
  • Familiarity with VMware Aria Operations, Aria Operations for Logs, or related VCF ecosystem tools.
  • Exposure to cloud operations in Azure or other public cloud environments.
  • Familiarity with security operations concepts, least privilege, audit logging, and compliance evidence collection.
  • ITIL Foundation or similar service management certification.

Responsibilities

  • Build and maintain dashboards, alerts, health checks, and service views for VCF platform operations.
  • Define, track, and report on SLOs, SLAs, and operational health indicators for core platform services.
  • Act as an escalation point for platform degradation, recurring alerts, and service incidents.
  • Investigate incidents, correlate telemetry across tools, and coordinate resolution with infrastructure and platform teams.
  • Tune alert thresholds and reduce noise through event correlation, dependency awareness, and operational feedback.
  • Support integration of VCF Operations with Dynatrace, ServiceNow, and other enterprise operations platforms.
  • Conduct post-incident reviews and help drive remediation tasks that improve platform stability and supportability.
  • Establish and refine performance baselines, threshold models, and capacity trending reports.
  • Contribute to automation of operational checks, alert enrichment, reporting, and remediation workflows.
  • Apply established cloud security and compliance requirements in monitoring, operational reporting, and access practices.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service