Windows Cloud Lead Engineer

NTT DATAAustin, TX
11d$128,000 - $146,170

About The Position

Lead, mentor, and develop a team of cloud Windows/Linux engineers and administrators. Allocate resources, manage schedules, and ensure adherence to SLAs and KPIs. Conduct regular team meetings, performance reviews, and training sessions. Cloud Operations Management Provide oversight for the daily operations, incident management, change management, and problem resolution across customer cloud environments. Supervise deployment, maintenance, and optimization of cloud infrastructure following best practices and security standards. Ensure monitoring, backup, patching, and disaster recovery configurations are properly implemented. Conduct root cause analyses for critical incidents and drive corrective/preventive actions. Customer/Stakeholder Engagement Act as escalation point for customers during incidents or critical issues. Participate in client meetings, provide technical guidance, and support cloud solution discussions. Prepare and present operational reports, incident summaries, and optimization recommendations. Automation & Continuous Improvement Oversee the development and implementation of automation tools/scripts for provisioning, configuration management, monitoring, and reporting. Identify areas for operational improvement and cost optimization.

Requirements

  • Bachelor's degree (or equivalent experience) in Computer Science, IT, Engineering, or a related field.
  • Primary: Windows Secondary: Linux
  • 12+ years' experience in Cloud operations/administration, with proven experience in (Windows or Linux).
  • 7+ years' experience in a technical team lead, supervisory, or project management role.
  • 5+ years multi-cloud experience (must have hands-on in at least 2 of AWS/Azure/GCP/OCI).
  • Direct experience in managed services/NOC/SOC/MSP environments is a must.
  • Deep understanding of Windows Server OS (2016, 2019, 2022): Active Directory, Group Policy, DNS, DHCP, WSUS, etc.
  • Strong Linux administration skills: RHEL, CentOS, Ubuntu, systemd, SELinux, cron, etc.
  • Scripting and automation: PowerShell (for Windows), Bash (for Linux), and tools like Ansible or Puppet.
  • Virtualization technologies: VMware vSphere, Hyper-V, KVM.
  • Patch management and OS lifecycle management.
  • Performance tuning and system optimization for both Windows and Linux servers.
  • Monitoring and logging tools: Nagios, Zabbix, Prometheus, Grafana, Windows Event Viewer, etc.
  • Backup and disaster recovery strategies and tools (e.g., Veeam, Bacula, Commvault).
  • Security hardening and compliance (e.g., CIS benchmarks, STIGs).
  • Networking fundamentals: TCP/IP, routing, firewalls, load balancing.

Nice To Haves

  • Microsoft Certified: Windows Server Hybrid Administrator Associate
  • Microsoft Certified: Azure Administrator Associate
  • Red Hat Certified System Administrator (RHCSA)
  • Red Hat Certified Engineer (RHCE)

Responsibilities

  • Lead, mentor, and develop a team of cloud Windows/Linux engineers and administrators.
  • Allocate resources, manage schedules, and ensure adherence to SLAs and KPIs.
  • Conduct regular team meetings, performance reviews, and training sessions.
  • Provide oversight for the daily operations, incident management, change management, and problem resolution across customer cloud environments.
  • Supervise deployment, maintenance, and optimization of cloud infrastructure following best practices and security standards.
  • Ensure monitoring, backup, patching, and disaster recovery configurations are properly implemented.
  • Conduct root cause analyses for critical incidents and drive corrective/preventive actions.
  • Act as escalation point for customers during incidents or critical issues.
  • Participate in client meetings, provide technical guidance, and support cloud solution discussions.
  • Prepare and present operational reports, incident summaries, and optimization recommendations.
  • Oversee the development and implementation of automation tools/scripts for provisioning, configuration management, monitoring, and reporting.
  • Identify areas for operational improvement and cost optimization.
  • Ensure all managed environments are operated according to organizational policies, compliance frameworks, and industry best practices.
  • Collaborate with security teams for vulnerability management, cloud security compliance, and incident investigation.

Benefits

  • medical
  • dental
  • vision insurance
  • flexible spending or health savings account
  • life and AD&D insurance
  • short and long term disability coverage
  • paid time off
  • employee assistance
  • participation in a 401k program with company match
  • additional voluntary or legally-required benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service