Public Cloud Linux Engineer

NTT AmericaPlano, TX
68dHybrid

About The Position

The Cloud Linux Operations & Support role involves providing support for Linux-based systems across on-premises and cloud environments, including AWS, Azure, GCP, and OCI. The position requires supporting customer self-provisioning of cloud instances with security guardrails and backend deployment. Responsibilities include implementing and maintaining system monitoring, alerting, and logging solutions to ensure high availability and reliability, leading root cause analysis, executing patch management, and developing backup and disaster recovery strategies. The role also includes participating in on-call rotation and after-hours support as required.

Requirements

  • Bachelor's degree (or equivalent experience) in Computer Science, IT, Engineering, or a related field.
  • Red Hat Certified System Administrator (RHCSA).
  • Red Hat Certified Engineer (RHCE).
  • Microsoft Certified: Azure Administrator Associate or Solutions Architect Expert.
  • 7+ years of hands-on experience in Public Cloud Linux engineering, operations in a 24*7 production support model.
  • 3+ years multi-cloud experience (preferred to have hands-on in at least 2 of AWS/Azure/GCP/OCI).
  • Direct experience in managed services/NOC/SOC/MSP environments is a plus.
  • In-depth expertise in provisioning, configuring, securing, supporting, and optimizing Linux-based systems (RHEL, CentOS, Ubuntu, etc.) in enterprise environments.
  • Basic expertise with cloud-native and hybrid workloads in AWS, Azure, GCP, and/or OCI.
  • Strong experience in managing compute, storage, networking, and system services on Linux platforms.
  • Proficient in system architecture, deployment, performance tuning, and troubleshooting of Linux servers.
  • Skilled in scripting languages such as Bash, Python, and Perl for automation and system management.
  • Proficient in using ServiceNow ITSM for incident, change, and problem management.
  • Strong understanding of Linux-based backup strategies, disaster recovery planning, and high-availability configurations (e.g., Pacemaker, DRBD).
  • Familiar with security tools and practices including SELinux, iptables, auditd, and fail2ban.
  • Experience with vulnerability assessment and patch management using tools like Qualys, OpenSCAP, or Lynis.
  • Proficient in log analysis and system monitoring using tools such as Syslog, Logrotate, Nagios, Prometheus, and Grafana.
  • Familiar with endpoint protection and threat detection tools such as CrowdStrike and OSSEC.
  • Strong knowledge of user access control, SSH key management, and secure file transfer protocols.
  • Ability to troubleshoot Linux services such as Apache, Nginx, MySQL, PostgreSQL, and Samba.

Nice To Haves

  • Maintaining Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.

Responsibilities

  • Provide support for Linux-based systems across on-premises and cloud environments (AWS, Azure, GCP, OCI).
  • Support customer self-provisioning of cloud instances across AWS, Azure, GCP, and OCI with security guardrails and backend deployment.
  • Implement and maintain system monitoring, alerting, and logging solutions to ensure high availability and reliability.
  • Lead root cause analysis and document post-incident reviews for major Linux-related issues.
  • Execute patch management, OS and kernel upgrades, and regular system maintenance.
  • Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure.
  • Participate in on-call rotation and after-hours support as required.
  • Develop and maintain automation scripts using PowerShell, Python, or Ansible to streamline system administration tasks.
  • Manage infrastructure using tools like Azure Automation.
  • Maintain Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.
  • Recommend and implement system optimization strategies for performance and resource utilization.
  • Enforce Linux system security best practices, including access control, encryption, and secure configurations.
  • Manage user access, sudo privileges, and SSH key policies in line with IAM standards.
  • Monitor, identify, and remediate security vulnerabilities reported by scanning tools or external advisories.
  • Support compliance initiatives by maintaining secure and auditable Linux environments.
  • Work closely with application, security, and network teams for solution delivery and support.
  • Mentor junior engineers and provide technical guidance as needed.
  • Create and update technical documentation, runbooks, and SOPs.
  • Participate in client calls to provide technical input when required.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Industry

Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services

Education Level

Bachelor's degree

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service