IT Manager - Cloud Operations

ADTIrving, TX
Hybrid

About The Position

The Manager leads cloud‑native and on-prem infrastructure operations with a primary focus on Google Cloud Platform and secondary oversight of Azure, AWS, and OCI. This role is accountable for the availability, resiliency, cost efficiency, and operational excellence of enterprise platforms spanning hypervisors, Linux/Unix and Windows systems, Citrix/VDI, data protection, disaster recovery, and site replication. The position combines strong people leadership, incident command, and FinOps accountability, managing blended teams of FTEs, contractors, and MSPs while ensuring effective budgeting, forecasting, inventory, and license management. The role drives successful execution of infrastructure projects, outages, and transformation initiatives.

Requirements

  • Strong expertise in Google Cloud Platform (GCP) and hybrid infrastructure environments.
  • Deep understanding of cloud-native vs. traditional infrastructure (hypervisors, virtualization, etc.).
  • Experience managing multi-domain infrastructure teams (Linux, Windows, storage, backup, VDI).
  • Knowledge of infrastructure provisioning, automation, and operational tooling.
  • Familiarity with data protection, disaster recovery, and business continuity planning.
  • Experience with Citrix / Virtual Desktop Infrastructure (VDI) environments.
  • Strong leadership skills with the ability to manage and develop technical teams.
  • Excellent problem-solving and decision-making capabilities in high-availability environments.
  • Strong communication skills, with the ability to work cross-functionally with technical and business stakeholders.
  • Experience driving process improvement and operational maturity.
  • Understanding FinOps, P&L, Opex/Capex and ability to do budget planning and tracking.
  • Infrastructure observability tools as well as capacity planning and management.
  • Bachelor’s degree or equivalent experience in IT, Computer Science, or related field.
  • 8–12+ years of experience in cloud operations, infrastructure, or data center environments.
  • 3–5+ years of experience leading or managing technical teams.
  • Hands-on experience with GCP or other major cloud platforms (AWS/Azure transferable).
  • Experience managing hybrid environments (cloud + on-prem).
  • Strong foundation across compute, storage, and OS-level operations.
  • IT financials experience with forecasting, inventory and license tracking.

Nice To Haves

  • GCP certifications (Professional Cloud Architect, etc.).
  • Experience with FinOps / cloud cost optimization strategies.
  • Background in enterprise infrastructure or managed services environments.
  • Exposure to automation tools (Terraform, scripting, etc.).
  • Experience scaling teams during periods of rapid growth or transformation.

Responsibilities

  • Lead Cloud and Infrastructure Operations teams supporting GCP centric hybrid environments, with secondary responsibility for Azure, AWS, and OCI.
  • Provide people leadership for blended teams (FTEs, contractors, MSPs), including performance management, coaching, and workforce planning.
  • Serve as the incident and outage commander, leading war rooms, coordinating recovery, delivering executive communications, and driving root cause analysis.
  • Own FinOps responsibilities including budgeting, forecasting, cost optimization, inventory tracking, and license management across hybrid and multi cloud platforms.
  • Oversee operations for compute, hypervisors, Linux/Unix and Windows systems, Citrix/VDI, storage, data protection, backup, site replication, and disaster recovery.
  • Provide operational governance for SaaS platforms and integrations, including Salesforce and ServiceNow, in partnership with application owners.
  • Lead delivery of infrastructure projects, migrations, refreshes, and cloud transformation initiatives.
  • Drive automation and Infrastructure as Code adoption to improve reliability, speed, and operational efficiency.
  • Establish and enforce operational standards, monitoring, runbooks, and on call practices.
  • Act as a senior escalation point for complex technical, operational, and vendor related issues.
  • Ensure high levels of system availability, performance, and scalability
  • Develop and maintain documentation, runbooks, and operational playbooks
  • Assist in capacity planning and infrastructure roadmap development.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service