About The Position

Triumph is looking for a hands-on, forward-thinking Lead DevOps Engineer to help scale and strengthen our cloud infrastructure. In this role, you’ll lead the design and evolution of secure, reliable, and high-performing systems while partnering closely with engineering, security, and operations teams. If you enjoy solving complex problems, improving systems at scale, and driving meaningful technical change, this is a great opportunity to make an impact. You’ll spend your day ensuring our cloud platforms run smoothly and efficiently optimizing Kubernetes clusters, improving CI/CD pipelines, and partnering with teams to deliver scalable, secure applications. From troubleshooting complex issues to rolling out automation and guiding infrastructure strategy, your work will directly impact performance and developer productivity across the organization.

Requirements

  • 5+ years of experience in DevOps, cloud engineering, or infrastructure roles
  • Strong hands-on experience with AWS services (EC2, S3, EKS, RDS, IAM, VPC, and more)
  • Deep expertise in Kubernetes and Helm
  • Experience with Terraform (Terragrunt is a plus) and CI/CD tools like Argo CD or GitHub Actions
  • Familiarity with Kafka, Redis, and Postgres in production environments
  • Experience managing MSSQL Server and performing database migrations
  • Solid understanding of networking in cloud and hybrid environments
  • Comfortable working in Linux environments
  • Experience supporting large-scale, multi-account AWS environments
  • Strong troubleshooting and problem-solving skills
  • Ability to communicate technical concepts clearly to different audiences
  • Experience working in Agile teams
  • Motivated to learn, grow, and pursue technical certifications

Nice To Haves

  • Familiarity with Snowflake or Looker is a plus

Responsibilities

  • Design, build, and maintain AWS-based cloud infrastructure, CI/CD pipelines, and automation tools
  • Partner with development teams to ensure applications are scalable, reliable, and secure
  • Optimize and manage Kubernetes clusters for performance, scalability, and consistency
  • Develop and maintain Helm charts for containerized applications
  • Manage and support Kafka clusters and streaming infrastructure
  • Collaborate with security teams to meet compliance standards (SOC2, SOX, FFIEC)
  • Monitor system performance using tools like Grafana, Loki, and other observability platforms
  • Automate deployments, configurations, and operational processes
  • Recommend and implement improvements to infrastructure design and DevOps practices
  • Define and implement SLOs and SLIs to enhance system reliability
  • Continuously improve DevOps workflows and platform efficiency

Benefits

  • Medical
  • Dental
  • Vision
  • Paid Time Off
  • 401k
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service