DevOps Engineer - AWS

TensorWaveLas Vegas, NV

About The Position

TensorWave is seeking an AWS Cloud Engineer to design, provision, optimize, and support the AWS infrastructure that powers their AMD GPU AI/HPC platform. This is a hands-on role where you will collaborate with backend engineers, developers, SREs, and platform teams to ensure the cloud infrastructure is reliable, cost-efficient, and scalable. The primary objective is to minimize cloud bottlenecks and provide a robust foundation for engineering teams.

Requirements

  • 5+ years in cloud infrastructure, DevOps, SRE, or platform operations
  • Hands-on AWS experience: VPCs, EC2, S3, IAM, CloudWatch, Route 53, load balancers, security groups, private networking
  • Proficiency with IaC tooling (Terraform strongly preferred)
  • Strong Linux fundamentals — networking, process management, storage, troubleshooting
  • Experience with CI/CD, Git-based workflows, and monitoring/alerting platforms
  • Clear communicator who can document infrastructure and collaborate across engineering teams

Nice To Haves

  • Experience with AI/ML, GPU, or HPC workloads
  • Kubernetes on AWS (EKS or self-managed)
  • Observability platforms: Prometheus, Grafana, Loki, OpenTelemetry, Datadog
  • AWS cost optimization: right-sizing, savings plans, lifecycle policies, tagging
  • Startup or high-growth infrastructure environment background

Responsibilities

  • Own the full lifecycle of AWS infrastructure across dev, staging, production, and customer-facing environments — provisioning, scaling, monitoring, security, cost optimization, and decommissioning
  • Build and maintain Infrastructure-as-Code (Terraform, Pulumi, AWS CDK, CloudFormation)
  • Implement cloud patterns for high availability, auto-scaling, secure service communication, and customer environment provisioning
  • Build and maintain CI/CD workflows for cloud infrastructure and hosted services
  • Improve observability through metrics, logging, alerting, dashboards, and runbooks
  • Troubleshoot AWS networking, compute, storage, IAM, and deployment issues
  • Participate in incident response, post-incident reviews, and root cause analysis
  • Document architecture, operational processes, and best practices

Benefits

  • Stock Options
  • 100% paid Medical, Dental, and Vision insurance for Employees
  • Company Health Savings Account Contributions
  • 100% paid Short Term and Long Term Disability Insurance for Employees
  • Life and Voluntary Supplemental Insurance Options
  • Other Insurance Options, such as Pet & Legal Insurance
  • Various Supplementary Health Benefits, such as discounted Virtual Healthcare Appointments and Serious Illness Support
  • Flexible Spending Account
  • 401(k)
  • Employee Assistance Program
  • Flexible PTO
  • Paid Holidays
  • Parental Leave
  • Other In-Office Perks
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service