NTT DATA North America-posted 3 months ago
Plano, TX
5,001-10,000 employees

The Cloud Linux Engineer is responsible for advanced technical support, administration, and optimization of managed customer cloud environments spanning AWS, Azure, Google Cloud Platform (GCP), and Oracle Cloud Infrastructure (OCI). This position demands Linux OS expertise, experience in public cloud, a strong understanding of managed services operations, and a proactive, problem-solving outlook. The Linux Cloud Engineer will also participate in automation initiatives, incident and change management, and mentor junior team members.

  • Provide support for Linux-based systems across on-premises and cloud environments (AWS, Azure, GCP, OCI).
  • Support Customer Self-Provision cloud instances across AWS, Azure, GCP, and OCI with security guardrail and backend deployment.
  • Implement and maintain system monitoring, alerting, and logging solutions to ensure high availability and reliability.
  • Lead root cause analysis and document post-incident reviews for major Linux-related issues.
  • Execute patch management, OS and kernel upgrades, and regular system maintenance.
  • Develop and maintain backup, disaster recovery, and failover strategies for Linux infrastructure.
  • Participate in on-call rotation and after-hours support as required.
  • Develop and maintain automation scripts using PowerShell, Python, or Ansible to streamline system administration tasks.
  • Manage infrastructure using tools like Azure Automation.
  • Maintain Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.
  • Recommend and implement system optimization strategies for performance and resource utilization.
  • Enforce Linux system security best practices, including access control, encryption, and secure configurations.
  • Manage user access, sudo privileges, and SSH key policies in line with IAM standards.
  • Monitor, identify, and remediate security vulnerabilities reported by scanning tools or external advisories.
  • Support compliance initiatives by maintaining secure and auditable Linux environments.
  • Work closely with application, security, and network teams for solution delivery and support.
  • Mentor junior engineers and provide technical guidance as needed.
  • Create and update technical documentation, runbooks, and SOPs.
  • Participate in client calls to provide technical input when required.
  • Bachelor’s degree (or equivalent experience) in Computer Science, IT, Engineering, or a related field.
  • At least two of the following certifications (or equivalent experience): Red Hat Certified System Administrator (RHCSA), Red Hat Certified Engineer (RHCE), Microsoft Certified: Azure Administrator Associate or Solutions Architect Expert.
  • 7+ years of hands-on experience in Public Cloud Linux engineering, operations in a 24*7 production support model.
  • 3+ years multi-cloud experience (preferred to have hands-on in at least 2 of AWS/Azure/GCP/OCI).
  • Direct experience in managed services/NOC/SOC/MSP environments is a plus.
  • In-depth expertise in provisioning, configuring, securing, supporting, and optimizing Linux-based systems (RHEL, CentOS, Ubuntu, etc.) in enterprise environments.
  • Basic expertise with cloud-native and hybrid workloads in AWS, Azure, GCP, and/or OCI.
  • Strong experience in managing compute, storage, networking, and system services on Linux platforms.
  • Proficient in system architecture, deployment, performance tuning, and troubleshooting of Linux servers.
  • Skilled in scripting languages such as Bash, Python, and Perl for automation and system management.
  • Proficient in using ServiceNow ITSM for incident, change, and problem management.
  • Strong understanding of Linux-based backup strategies, disaster recovery planning, and high-availability configurations (e.g., Pacemaker, DRBD).
  • Familiar with security tools and practices including SELinux, iptables, auditd, and fail2ban.
  • Experience with vulnerability assessment and patch management using tools like Qualys, OpenSCAP, or Lynis.
  • Proficient in log analysis and system monitoring using tools such as Syslog, Logrotate, Nagios, Prometheus, and Grafana.
  • Familiar with endpoint protection and threat detection tools such as CrowdStrike and OSSEC.
  • Strong knowledge of user access control, SSH key management, and secure file transfer protocols.
  • Ability to troubleshoot Linux services such as Apache, Nginx, MySQL, PostgreSQL, and Samba.
  • Maintaining Infrastructure as Code (IaC) templates using tools such as Terraform, CloudFormation, ARM, or OCI Resource Manager.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service