DevOps Manager

HashgraphDallas, TX
1dRemote

About The Position

Hashgraph is seeking an experienced DevOps Manager to lead our DevOps team in supporting the operations of consensus nodes across Hedera testnet, previewnet, and preproduction environments. This role requires a hands-on technical leader who can balance strategic planning with day-to-day operational excellence in our web3 infrastructure. As the DevOps Manager, you will lead a team of operations engineers while remaining technically engaged in building automation, improving infrastructure as code, and coordinating with Hedera Governing Council members. You'll be responsible for team development, process optimization, and ensuring 24/7 operational readiness of critical Hedera network infrastructure. This role requires strong technical expertise in cloud infrastructure (particularly GCP), infrastructure as code tools (Terraform, Ansible), and container orchestration (Kubernetes), combined with proven people management skills to mentor, grow, and retain top engineering talent.

Requirements

  • B.S in Computer Science or a similar study
  • 3+ years of people management experience leading DevOps or infrastructure engineering teams
  • 7+ years of DevOps or software development experience
  • 5+ years of experience running AWS / GCP / Azure cloud workloads at scale
  • Strong hands-on experience with Terraform, Kubernetes, and Ansible
  • Deeply familiar with operating and troubleshooting issues in a Linux environment
  • Proven track record of building high-performing teams and developing engineering talent
  • Experience with incident management, on-call rotations, and post-mortem processes
  • Deeply familiar with DevOps and software development lifecycle best practices
  • Strong written and verbal communication skills, including the ability to interface with senior leadership
  • Comfortable leading a fully remote, distributed team across multiple time zones

Nice To Haves

  • Experience in blockchain, web3, or distributed systems operations
  • Familiarity with the LGTM stack and observability best practices
  • Programming experience in Golang, Python, Bash, Java, or JavaScript
  • Experience with Jenkins Pipelines, Github, and Github Actions
  • Background in SRE principles and practices

Responsibilities

  • Lead and mentor a team of DevOps engineers, providing technical guidance and career development
  • Manage day-to-day operations of Hedera production and preproduction infrastructure
  • Coordinate with Hedera Governing Council members on operational matters and infrastructure requirements
  • Design and implement automation solutions to reduce operational toil and improve efficiency
  • Own and evolve infrastructure as code practices using Terraform and Ansible
  • Establish and maintain incident management processes, including on-call rotations and post-mortem reviews
  • Drive continuous improvement initiatives for monitoring, observability, and alerting systems
  • Manage capacity planning and scaling strategies for cloud and bare metal infrastructure
  • Ensure 24/7 operational readiness and lead response to critical incidents
  • Lead hiring efforts to grow the DevOps team, including defining role requirements, interviewing candidates, and making hiring decisions
  • Collaborate with development teams to improve CI/CD pipelines and deployment processes
  • Define and track team KPIs, SLOs, and operational metrics
  • Manage team budget and resource allocation
  • Interface with senior leadership on strategic planning and technical roadmap
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service