Observability Engineer

Eleventh Hire IncIrving, TX
306d

About The Position

The position is for a leadership role within a distributed Monitoring, Observability, IT operations, DevOps, or SRE group at an American casino and resort company headquartered in Las Vegas, Nevada. The candidate should have over 5 years of demonstrated experience in these areas, with a strong emphasis on on-premises IT infrastructure, applications, and both private and public cloud monitoring. The role involves working closely with the Central Head of Operations to determine and execute priorities for monitoring, alerting, and observability KPIs that are essential for the organization.

Requirements

  • 5+ years of experience in leading distributed Monitoring, Observability, IT operations, DevOps, or SRE groups.
  • Strong expertise in on-prem IT infrastructure.
  • Proficient in Linux/Unix and container orchestration (e.g., Kubernetes).
  • Experience with ITRS is a plus.
  • Strong scripting skills in Python, Java, and RESTful Services.
  • Technical acumen in Cloud Architecture, Performance Benchmarking, and Capacity planning.
  • Experience with tools like Harness, GitLab, Terraform, Ansible, or CloudFormation.

Nice To Haves

  • Experience in diagnosing performance bottlenecks using observability data.
  • A unique hybrid of experience in both network and new-wave SRE.

Responsibilities

  • Lead distributed Monitoring, Observability, IT operations, DevOps, or SRE groups.
  • Work with Central Head of Operations to decide and execute upon priorities for monitoring, alerting, and observability KPIs.
  • Diagnose performance bottlenecks and other system issues using observability data.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service