Senior DevOps Engineer

TheIncLabNashville, TN
23h

About The Position

The Mission Starts Here TheIncLab engineers and delivers intelligent digital applications and platforms that revolutionize how our customers and mission-critical teams achieve success. Your Mission, Should You Choose to Accept We’re looking for a Senior DevOps Engineer who is passionate about automation, operational excellence, and building reliable, scalable infrastructure. This role blends core DevOps responsibilities with a strong emphasis on Site Reliability Engineering (SRE), helping ensure system uptime, performance, and observability across environments. The ideal candidate brings hands-on experience managing CI/CD pipelines, cloud infrastructure, and production operations, with a mindset oriented toward reducing toil and driving continuous improvement. What will you do? DevOps Engineering Build, maintain, and improve CI/CD pipelines using GitLab CI/CD or similar tools. Automate infrastructure provisioning, deployment, and maintenance using Terraform, Ansible, or related technologies. Collaborate with developers and QA to create reliable deployment paths from local dev to production. Implement infrastructure-as-code practices across environments (e.g., AWS, Kubernetes, bare-metal). Site Reliability Engineering (SRE) Design and implement monitoring, alerting, and observability systems to maintain high availability and performance. Respond to incidents, lead root cause analysis, and implement preventive measures. Establish and evolve SLOs/SLIs to ensure measurable system reliability. Participate in on-call rotation and help build automation to reduce the need for human intervention. Drive capacity planning, performance tuning, and cost optimization initiatives. System Operations Administer Linux (Ubuntu/Debian) and Windows-based infrastructure. Manage self-hosted GitLab instances and ensure secure, performant operation. Implement and enforce security best practices across infrastructure (IAM, RBAC, least privilege, etc.). Support both containerized and virtualized workloads across environments.

Requirements

  • 5+ years in DevOps, SRE, or Infrastructure Engineering roles.
  • Hands-on experience and proficiency with AWS services (EC2, S3, RDS, VPC, IAM, etc.) and infrastructure automation (Terraform, Ansible, or similar).
  • Experience deploying and managing infrastructure using Terraform and/or Ansible.
  • Solid knowledge of Linux system administration.
  • Strong skills in Windows system administration environments.
  • Proven experience managing and automating GitLab, including CI/CD pipelines.
  • Proficiency in at least one programming or scripting language (Python, Bash, etc.).
  • Experience implementing monitoring, logging, and alerting solutions (CloudWatch, Datadog, CloudTrail).
  • Solid understanding of networking, security best practices, and high-availability system design.
  • Familiarity with version control systems (Git) and GitLab workflows.
  • Strong troubleshooting and incident response skills, with a focus on automation and root cause analysis.
  • Ability to travel up to 20%.
  • Applicants must be a U.S. Citizen and willing and eligible to obtain a U.S. Security Clearance at the Secret or Top-Secret level.

Nice To Haves

  • AWS certification or equivalent practical experience.
  • Knowledge of cloud cost optimization and efficiency practices.
  • Experience with self-hosted Gitlab instances and CI/CD pipelines.
  • Existing clearance is preferred.

Responsibilities

  • Build, maintain, and improve CI/CD pipelines using GitLab CI/CD or similar tools.
  • Automate infrastructure provisioning, deployment, and maintenance using Terraform, Ansible, or related technologies.
  • Collaborate with developers and QA to create reliable deployment paths from local dev to production.
  • Implement infrastructure-as-code practices across environments (e.g., AWS, Kubernetes, bare-metal).
  • Design and implement monitoring, alerting, and observability systems to maintain high availability and performance.
  • Respond to incidents, lead root cause analysis, and implement preventive measures.
  • Establish and evolve SLOs/SLIs to ensure measurable system reliability.
  • Participate in on-call rotation and help build automation to reduce the need for human intervention.
  • Drive capacity planning, performance tuning, and cost optimization initiatives.
  • Administer Linux (Ubuntu/Debian) and Windows-based infrastructure.
  • Manage self-hosted GitLab instances and ensure secure, performant operation.
  • Implement and enforce security best practices across infrastructure (IAM, RBAC, least privilege, etc.).
  • Support both containerized and virtualized workloads across environments.

Benefits

  • Hybrid and flexible work schedules
  • Professional development programs
  • Training and certification reimbursement
  • Extended and floating holiday schedule
  • Paid time off and Paid volunteer time
  • Health and Wellness Benefits include options for Medical, Dental, and Vision insurance along with access to Wellness, Mental Health, and Employee Assistance Programs.
  • 100% Company Paid Benefits that include STD, LTD, and Basic Life insurance.
  • 401(k) Plan Options with employer matching
  • Incentive bonuses for eligible clearances, performance, and employee referrals.
  • A company culture that values your individual strengths, career goals, and contributions to the team.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service