Linux Systems Reliability Engineer

NutanixDurham, NC
$117,600 - $160,000Hybrid

About The Position

Are you a detail-oriented problem solver with a passion for optimizing cloud operations and a knack for writing efficient scripts? If so, you'll thrive in our dynamic Nutanix team, where your skills will directly impact the availability and performance of our cutting-edge cloud technology while collaborating with a global network of talented professionals dedicated to innovation and operational excellence. At Nutanix, you'll join the Private Cloud team within the broader Global Cloud Operations (GCO) team, a dynamic assembly of over 70 passionate individuals spread across the US, Netherlands, Serbia, and India. This diverse group thrives in a high-energy IT environment that prioritizes innovation and operational excellence. You'll experience a collaborative culture that fosters teamwork and a sense of unity, despite our geographical dispersion. The GCO team plays a crucial role in ensuring the smooth operation of critical systems, leveraging cutting-edge technologies and automation to achieve our goals. Our work setup is hybrid, requiring you to be on-site three days a week while giving you the flexibility to work remotely for the remaining days. Travel commitments are very minimal, as they are not a primary aspect of the role and will only occasionally arise as needed for collaboration or specific events.

Requirements

  • Proficiency in Linux/UNIX system administration with advanced troubleshooting skills.
  • Strong scripting capabilities in languages such as Python and Bash.
  • Experience in a 24/7 NOC environment, preferably with a cloud service provider.
  • Solid understanding of cloud infrastructure components (firewalls, load balancers, DNS, etc.).
  • Knowledge of cloud technologies and architectures, especially in SaaS environments.
  • Must have at least 3-5 years of hands-on experience with Nutanix AOS, AHV, and Prism Central, and VMware/Proxmox/KVM.
  • Excellent problem-solving abilities and strong communication skills.

Nice To Haves

  • Kubernetes is a big plus.
  • Experience with Kubernetes would be a strong plus.
  • A relevant degree in Computer Science, Information Technology, or a related field is preferred.

Responsibilities

  • Ensure the 24/7 availability and reliability of Nutanix's cloud services and infrastructure.
  • Respond promptly to alerts and support tickets, troubleshooting and resolving issues effectively.
  • Collaborate with QA, Development, and Infrastructure teams to design and implement robust monitoring solutions.
  • Manage deployment of software patches, upgrades, and administrative tools to maintain system integrity.
  • Participate in on-call rotation to provide after-hours support and maintain service level agreements (SLAs).
  • Develop and enhance automation scripts using languages like Python or Bash for operational efficiency.
  • Document processes and procedures for knowledge sharing and continuous improvement within the team.
  • Achieve first-year objectives by streamlining incident response processes and enhancing system monitoring capabilities.

Benefits

  • 401(k) eligibility
  • various paid time off benefits, such as vacation, sick time, and parental leave
  • sign-on bonus
  • restricted stock units
  • discretionary awards
  • full range of medical, financial, and/or other benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service