Supervisor, Server & Storage

ProenergyBuenos Aires, TX
1dHybrid

About The Position

The Supervisor, Server & Storage is a senior technical leader responsible for architecting, implementing, and managing enterprise-scale virtualization, storage, and cloud infrastructure platforms across ProEnergy's global operations. This role demands expert-level proficiency in hypervisor technologies, hyperconverged infrastructure and hybrid management solutions, and storage solution. The position requires advanced automation and scripting capabilities using PowerShell, Python, Terraform, and Infrastructure as Code (IaC) methodologies to drive operational efficiency and infrastructure modernization at scale. This strategic leadership role includes managing geographically distributed teams across multiple regions, overseeing 24/7 infrastructure operations supporting both IT and OT environments, and ensuring 99.9%+ availability for mission-critical systems spanning power generation facilities, manufacturing operations, and enterprise business systems internationally. The ideal candidate brings 15+ years of deep technical expertise combined with proven experience leading large, global infrastructure teams through complex modernization initiatives including cloud migration, hybrid infrastructure deployment, and advanced automation implementation.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, Engineering, or related technical discipline (Master's degree preferred)
  • Minimum 12-15 years of progressive experience in enterprise infrastructure architecture and administration
  • Minimum 7-10 years of direct leadership experience managing technical teams, with at least 5 years managing geographically distributed global teams
  • Proven track record managing large-scale infrastructure supporting 5,000+ users and 2,000+ virtual machines across multiple geographic regions
  • Demonstrated experience leading teams of 6+ infrastructure professionals across multiple time zones and cultural environments
  • Experience in mission-critical environments such as power generation, utilities, manufacturing, financial services, or healthcare
  • Travel up to 25% (global sites, vendor engagements, team meetings)
  • Expert-level proficiency with VMware vSphere 7.x/8.x including vCenter Server, ESXi, DRS, HA, vMotion, Storage vMotion, Distributed Virtual Switches, and NSX-T integration
  • Advanced expertise in Microsoft Hyper-V and System Center Virtual Machine Manager (SCVMM) for enterprise virtualization management
  • Deep experience with hyperconverged infrastructure platforms: Azure Stack HCI, Nutanix AOS/AHV, Dell VxRail, or HPE SimpliVity
  • Comprehensive knowledge of Azure Stack HCI (Azure Local) architecture, deployment, and integration with Azure cloud services
  • Expert-level experience with Azure Arc including Arc-enabled servers, Arc-enabled Kubernetes, and hybrid infrastructure management
  • Advanced PowerShell scripting capabilities including module development, REST API integration, error handling, and pipeline automation
  • Proficiency with Python for infrastructure automation, data analysis, and API integrations with cloud platforms and management tools
  • Strong experience with Infrastructure as Code (IaC) using Terraform, ARM templates, Bicep, or CloudFormation for multi-cloud deployments
  • Advanced knowledge of Windows Server 2016/2019/2022 including Active Directory, Group Policy, clustering, and advanced networking features
  • Strong Linux administration skills including RHEL, Ubuntu, SUSE with expertise in shell scripting, systemd, and enterprise management
  • Comprehensive experience with enterprise storage systems including SAN (FC, iSCSI), NAS, software-defined storage, and backup solutions
  • Expert knowledge of Microsoft Azure including IaaS, PaaS, networking, security, governance, and cost optimization strategies
  • Working knowledge of AWS or Google Cloud Platform for multi-cloud strategy and workload portability
  • Experience with container technologies including Docker, Kubernetes, Azure Kubernetes Service (AKS), and container orchestration
  • Proficiency with configuration management tools: Ansible, Chef, Puppet, or Azure Automation DSC
  • Advanced monitoring and observability solutions: Azure Monitor, vRealize Operations, Datadog, Splunk, or Prometheus/Grafana
  • Understanding of CI/CD pipelines using Azure DevOps, GitHub Actions, GitLab CI, or Jenkins for infrastructure automation
  • US work authorization is a precondition of employment. The company will not consider candidates who require sponsorship for a work-authorized visa.
  • Successful candidate will need to satisfactorily complete pre-employment drug screen and background
  • Proven ability to lead, inspire, and develop high-performing technical teams across multiple geographic regions and cultures
  • Experience managing 24/7 global operations with follow-the-sun support models and coordinated time zone coverage
  • Strong project management skills with experience leading complex, multi-million dollar infrastructure transformation initiatives
  • Exceptional communication skills with ability to present technical concepts to executive leadership, business stakeholders, and technical audiences
  • Demonstrated experience with budget management, financial planning, and cost optimization for infrastructure operations
  • Strategic thinking capabilities with ability to align technology initiatives with business objectives and drive organizational change
  • Strong vendor management skills including contract negotiation, performance management, and strategic partnership development
  • Experience with change management, organizational transformation, and driving cultural adoption of new technologies and processes
  • Ability to work effectively under pressure, manage competing priorities, and make sound technical decisions in time-critical situations
  • Commitment to fostering inclusive, diverse teams and creating positive work environments across cultural and geographic boundaries
  • Deep understanding of operational technology (OT) environments, industrial control systems, and IT/OT convergence challenges
  • Knowledge of NERC-CIP Critical Infrastructure Protection requirements for power generation facilities
  • Familiarity with SOX compliance requirements for financial systems and audit controls
  • Understanding of ISO 27001, NIST Cybersecurity Framework, and enterprise security best practices
  • Experience with high availability requirements, disaster recovery planning, and business continuity management
  • Knowledge of GDPR, data sovereignty, and international data protection regulations for global operations

Nice To Haves

  • Experience with VMware Cloud Foundation (VCF) for software-defined data center deployments
  • Knowledge of Azure VMware Solution (AVS) for seamless workload migration and hybrid cloud integration
  • Familiarity with service mesh technologies (Istio, Linkerd) and advanced Kubernetes networking
  • Experience with NVMe over Fabrics (NVMe-oF) and next-generation storage protocols
  • Understanding of AI/ML infrastructure requirements including GPU virtualization and accelerated computing platforms
  • Knowledge of FinOps best practices, cloud cost allocation, and chargeback/showback models
  • Experience with infrastructure observability and AIOps platforms for predictive analytics and intelligent automation
  • Familiarity with edge computing architectures and distributed infrastructure management
  • Experience in power generation, oil & gas, or utility industries with understanding of SCADA and industrial control systems
  • Knowledge of lean/agile methodologies, DevOps practices, and site reliability engineering (SRE) principles
  • Experience with database administration including SQL Server, PostgreSQL, MongoDB, and database infrastructure optimization
  • Multilingual capabilities supporting global team communication and stakeholder engagement
  • VMware Certified Professional - Data Center Virtualization (VCP-DCV) or higher
  • Microsoft Certified: Azure Administrator Associate or Azure Solutions Architect Expert
  • Microsoft Certified: Windows Server Hybrid Administrator Associate
  • HashiCorp Certified: Terraform Associate or equivalent IaC certification
  • ITIL 4 Foundation (minimum) with preference for ITIL Managing Professional or Strategic Leader
  • One or more storage certifications: CompTIA Storage+, Dell EMC Storage Administrator, NetApp Certified Data Administrator, or equivalent
  • Preferred VMware Certified Advanced Professional (VCAP) or VMware Certified Design Expert (VCDX)
  • Microsoft Certified: Azure Solutions Architect Expert or Azure DevOps Engineer Expert
  • Nutanix Certified Professional (NCP) or Nutanix Certified Master (NCM) for hyperconverged infrastructure
  • AWS Certified Solutions Architect Professional or Google Cloud Professional Cloud Architect
  • Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)
  • Red Hat Certified Engineer (RHCE) or Red Hat Certified Architect (RHCA)
  • CISSP, CISM, or other advanced security certifications
  • Project Management Professional (PMP) or Certified ScrumMaster (CSM)
  • Vendor-specific certifications: Dell EMC, HPE, Cisco, Pure Storage, or NetApp advanced credentials

Responsibilities

  • Architect and manage enterprise-scale VMware vSphere environments including vCenter Server, ESXi clusters, Distributed Resource Scheduler (DRS), High Availability (HA), vMotion, Storage vMotion, and Distributed Virtual Switches (DVS)
  • Deploy and optimize Enterprise Cloud platforms
  • Implement advanced virtualization features including nested virtualization, virtual GPU (vGPU) configurations, SR-IOV networking, and NUMA optimization for performance-critical workloads
  • Design multi-tenant virtualization environments with resource pools, quotas, reservations, and shares ensuring fair resource allocation and isolation
  • Manage VMware vSphere for containerized workloads and modern application platforms
  • Oversee disaster recovery solutions including VMware Site Recovery Manager (SRM) for business continuity
  • Architect and deploy Azure Stack HCI (Azure Local) solutions integrating on-premises infrastructure with Azure cloud services, Azure Portal management, and Azure Arc connectivity
  • Implement HCI platforms with integrated compute, storage, and networking utilizing distributed storage fabric, data locality, and intelligent data tiering
  • Deploy hyperconverged solutions with automated lifecycle management and seamless scalability
  • Configure Storage with fault domains, storage QoS, deduplication, and compression optimization
  • Manage HCI cluster networking including Software Defined Networking (SDN), Network Controller, load balancing, and microsegmentation
  • Implement stretched HCI clusters for metro-area disaster recovery and continuous availability across data center sites
  • Integrate HCI with Azure Backup, Azure Site Recovery, and Azure Monitor for hybrid cloud management and protection
  • Deploy and manage Azure Arc-enabled servers for unified management of Windows and Linux systems across on-premises, multi-cloud, and edge locations
  • Implement Azure Arc-enabled Kubernetes for centralized governance, policy enforcement, and GitOps-based deployment across hybrid Kubernetes clusters
  • Configure Azure Arc-enabled data services including SQL Managed Instance and PostgreSQL Hyperscale for cloud-native data platforms on-premises
  • Leverage Azure Policy, Azure RBAC, and Azure Security Center integration through Arc for consistent security and compliance across hybrid environments
  • Implement Azure Monitor, Azure Sentinel, and Log Analytics integration via Arc for unified observability and security monitoring
  • Manage Azure Arc resource bridge deployments for extending Azure management plane to VMware vSphere and System Center Virtual Machine Manager environments
  • Orchestrate hybrid workload deployments using Azure Resource Manager (ARM) templates, Bicep, and Terraform across Arc-connected infrastructure
  • Develop sophisticated PowerShell scripts and modules for automated infrastructure provisioning, configuration management, and operational tasks across Windows and hybrid environments
  • Create Python automation solutions integrating with REST APIs, VMware PowerCLI, Azure SDKs, and infrastructure management platforms
  • Implement Infrastructure as Code (IaC) using HashiCorp Terraform for multi-cloud infrastructure deployment, Azure ARM templates, and Bicep for Azure resource provisioning
  • Deploy Ansible, Chef, or Puppet for configuration management and automated compliance enforcement across server fleets
  • Build CI/CD pipelines using Azure DevOps, GitHub Actions, or GitLab CI for infrastructure deployment automation and testing
  • Develop custom automation solutions for capacity management, performance optimization, patch orchestration, and compliance reporting
  • Implement Azure Automation, Azure Functions, and Logic Apps for cloud-native automation and event-driven orchestration
  • Create self-service portals and automated workflows reducing manual intervention and accelerating infrastructure delivery times by 70%+
  • Architect and manage enterprise SAN environments including Fiber Channel, iSCSI, and FCoE protocols with multipathing (MPIO) and load balancing
  • Deploy and optimize NAS solutions with NFS, SMB/CIFS protocols supporting structured and unstructured data workloads
  • Implement software-defined storage platforms including VMware vSAN, Microsoft Storage Spaces Direct, Nutanix Distributed Storage Fabric, and Ceph
  • Manage object storage solutions (Azure Blob Storage, AWS S3, MinIO) for cloud-native applications and data lake architectures
  • Configure storage tiering strategies with NVMe flash, SSD, and nearline SAS/SATA drives optimizing performance and cost efficiency
  • Implement data protection features including snapshots, replication, deduplication, compression, and encryption at rest/in transit
  • Manage backup infrastructure using Commvault or Azure Backup with immutable storage and ransomware protection
  • Design and implement disaster recovery solutions with RPO/RTO objectives meeting business continuity requirements
  • Lead and mentor a geographically distributed teams of infrastructure professionals across North America, South America, Europe, and Asia-Pacific regions
  • Manage 24/7 follow-the-sun support models with coordinated handoffs across time zones ensuring continuous operational coverage
  • Conduct performance management including goal setting, quarterly reviews, competency assessments, and career development planning for direct reports
  • Recruit, hire, and onboard senior technical talent globally, building high-performing teams with diverse skill sets and cultural perspectives
  • Develop technical training programs, certification roadmaps, and knowledge transfer initiatives ensuring team capabilities match evolving technology requirements
  • Foster culture of innovation, continuous improvement, and operational excellence through regular team collaboration, retrospectives, and best practice sharing
  • Coordinate cross-functional collaboration with network, security, application, and OT teams for integrated infrastructure solutions
  • Manage employee relations, conflict resolution, and performance improvement plans maintaining positive team dynamics across cultural boundaries
  • Implement succession planning strategies identifying and developing future technical leaders within the organization
  • Drive multi-year infrastructure roadmap development aligning technology initiatives with business objectives and operational requirements
  • Lead infrastructure modernization programs including legacy system migrations, technology refresh cycles, and cloud adoption strategies
  • Establish and maintain infrastructure standards, architectural principles, and design patterns ensuring consistency across global deployments
  • Develop and enforce governance frameworks for cloud resource management, security policies, and compliance requirements
  • Coordinate with enterprise architecture teams on technology evaluations, proof-of-concepts, and strategic technology decisions
  • Implement ITIL 4 service management practices including incident, problem, change, and capacity management processes
  • Lead vendor selection processes, contract negotiations, and strategic partnerships for hardware, software, and professional services
  • Develop comprehensive technical documentation including architecture diagrams, runbooks, standard operating procedures, and disaster recovery plans
  • Define and monitor infrastructure KPIs including availability (99.9%+ for critical systems), performance metrics, capacity utilization, and service quality indicators
  • Conduct capacity planning and forecasting for compute, storage, and network resources supporting 3-5 year growth projections
  • Lead root cause analysis for major incidents, implement corrective actions, and drive continuous improvement initiatives
  • Manage an infrastructure budget annually including capital expenditures, operational expenses, and cloud consumption costs
  • Implement FinOps practices for cloud cost optimization including right-sizing, reserved instances, spot instances, and resource tagging strategies
  • Develop business cases and ROI analyses for infrastructure investments demonstrating value delivery and strategic alignment
  • Negotiate enterprise agreements, volume licensing, and support contracts achieving 20-30% cost reductions through strategic procurement
  • Track and report on project expenditures, variance analysis, and financial performance against approved budgets
  • Implement security hardening standards for hypervisors, operating systems, and infrastructure components following CIS benchmarks and industry best practices
  • Coordinate with cybersecurity teams on vulnerability management, patch orchestration, and security incident response for infrastructure systems
  • Ensure compliance with NERC-CIP, SOX, NIST CSF, ISO 27001, GDPR, and industry-specific regulatory requirements across global infrastructure
  • Implement privileged access management (PAM), just-in-time access, and audit logging for infrastructure administration
  • Design and test business continuity plans including disaster recovery exercises, failover testing, and recovery validation
  • Deploy infrastructure monitoring solutions using Azure Monitor, VMware vRealize Operations, Datadog, or Splunk for proactive alerting and analytics
  • Manage change advisory board processes ensuring controlled implementation of infrastructure modifications with minimal business impact
  • Conduct regular infrastructure health assessments, performance reviews, and optimization initiatives maintaining peak operational efficiency

Benefits

  • competitive pay
  • excellent benefits that include Medical, Dental, Vision, and Life/Disability Insurance at minimal cost to the employee
  • 10 paid holidays
  • paid time off
  • 401K plan

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

501-1,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service