Senior AWS DevOps Engineer

Growth Acceleration PartnersColorado Springs, CO
4h

About The Position

We are looking for a Cloud / Kubernetes Engineer to support hybrid infrastructure across AWS and on-premise Kubernetes environments. This role blends foundational AWS execution (Level 1) with operational ownership of Kubernetes clusters (Level 2), including SUSE-based distributions and Rancher-managed environments. You will contribute to production reliability, cluster lifecycle management, and hybrid cloud connectivity, ensuring scalable, secure, and resilient infrastructure across environments. This role is ideal for an engineer who thrives in hands-on operational environments and enjoys troubleshooting complex infrastructure systems.

Requirements

  • Bachelor’s Degree in Computer Science, Information Systems, or equivalent practical experience.
  • 2–4 years of hands-on Kubernetes experience
  • 1–3 years of AWS infrastructure experience
  • 2+ years administering Linux systems (SUSE preferred)
  • Experience working in production environments with on-call participation
  • IAM (roles, policies, STS)
  • VPC fundamentals
  • EC2, S3, RDS, Lambda
  • AWS CLI
  • Terraform
  • Pods, Deployments, StatefulSets
  • Ingress
  • RBAC
  • Storage classes & persistent volumes
  • Node lifecycle management
  • Production incident troubleshooting
  • Cluster upgrades
  • SUSE Linux Enterprise Server (SLES)
  • zypper package management
  • Networking fundamentals
  • Linux security fundamentals
  • Strong troubleshooting mindset
  • Comfortable in hybrid cloud environments
  • Clear documentation habits
  • Strong communication skills
  • Reliable in on-call rotations

Nice To Haves

  • Experience with EKS
  • etcd administration
  • GitOps practices
  • Container security tooling
  • Monitoring stacks (Prometheus, Grafana, ELK)

Responsibilities

  • Support multi-account AWS environments using AWS Organizations
  • Assist with VPC configuration (subnets, routing tables, NAT, IGW, security groups)
  • Deploy and maintain EC2, IAM roles, S3, RDS, Lambda, and CloudWatch resources
  • Troubleshoot host-level issues and IAM permission challenges
  • Execute infrastructure changes via Terraform
  • Contribute to observability and monitoring (LogicMonitor, Datadog, Prometheus)
  • Participate in on-call rotation
  • Perform routine governance, risk, and compliance (GRC) tasks
  • Operate and maintain production on-prem Kubernetes clusters
  • Manage clusters provisioned and governed through Rancher
  • Perform lifecycle management (node provisioning, upgrades, patching)
  • Troubleshoot control plane and worker node issues
  • Monitor etcd health and cluster resiliency
  • Configure ingress controllers
  • Implement RBAC and namespace governance
  • Perform rolling upgrades with minimal disruption
  • Manage Deployments, StatefulSets, DaemonSets, and Services
  • Deploy and manage Helm-based applications
  • Configure persistent storage (CSI drivers, storage classes)
  • Integrate container registries
  • Support cluster security hardening
  • Implement monitoring and logging integrations
  • Support hybrid connectivity between on-prem and AWS
  • Administer SUSE Linux Enterprise Server (SLES)
  • Perform system patching and kernel updates
  • Manage networking and systemd services
  • Troubleshoot OS-level performance issues
  • Support capacity planning for on-prem clusters
  • Contribute to Infrastructure-as-Code using Terraform
  • Automate operational tasks using Python or Bash
  • Maintain CI/CD pipelines (Jenkins, GitHub Actions)
  • Document runbooks and operational procedures
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service