Digital Technology Server Engineer Lead

Oshkosh CorporationMenasha, WI

About The Position

At Oshkosh, we build, serve and protect people and communities around the world by designing and manufacturing some of the toughest specialty trucks and access equipment. We employ over 18,000 team members all united by a common purpose. Our engineering and product innovation help keep soldiers and firefighters safe, is critical in building and keeping communities clean and helps people do their jobs every day. Server Engineer - Lead (Enterprise Compute and Virtualization) Enterprise infrastructure role focused on Intel-based compute, virtualization operations, automation, and adjacent technology integration. The Server Engineer - Lead is a senior technical contributor responsible for architecting, operating, and optimizing Oshkosh Corporation's enterprise compute and virtualization ecosystem. This role ensures the security, stability, scalability, and lifecycle management of a large fleet of Intel-based server hardware and virtual infrastructure built on VMware and other hypervisors that support critical manufacturing, ERP, engineering, and enterprise business applications across data center and hybrid cloud environments. The successful candidate will combine deep expertise in enterprise server hardware, virtualization, automation, and infrastructure operations to drive operational excellence, standardization, resiliency, and modernization. They will be accountable for the full compute platform stack, from hardware strategy and firmware governance to virtualization design, performance optimization, capacity planning, and operational support, while advancing automation, monitoring, and platform engineering practices that improve reliability and efficiency at scale. Enterprise Compute and Virtualization Platform Management Lead the architecture, implementation, lifecycle management, and operational support of Oshkosh's large-scale enterprise compute environment, including Intel-based server platforms and virtualization platforms based on VMware and other hypervisors. Manage server hardware standards, platform design, firmware strategy, host lifecycle management, and infrastructure refresh planning across the enterprise. Oversee the health, performance, availability, and capacity of virtualization environments, including hypervisor hosts, management platforms, clusters, distributed resource scheduling, and high availability configurations. Design and maintain resilient compute and virtualization solutions that support mission-critical enterprise workloads with strong emphasis on uptime, recoverability, and operational consistency. Partner with storage, networking, backup, security, and application teams to ensure end-to-end performance and reliability of hosted workloads. Establish and maintain operational standards for provisioning, patching, upgrades, host remediation, and configuration consistency across the compute estate. Infrastructure Operations and Platform Reliability Serve as a senior escalation point for complex server hardware and virtualization incidents, leading root-cause analysis and restoration efforts for critical platform issues. Drive proactive management of system health, resource utilization, hardware events, performance bottlenecks, and operational risks across the compute environment. Lead capacity planning and performance optimization efforts for compute, memory, clustering, and virtualization resources to support current and future business demand. Oversee platform resilience through design and support of high-availability, fault-tolerant, and disaster recovery aligned infrastructure services. Ensure enterprise operational readiness for maintenance events, lifecycle transitions, incident response, and business continuity requirements. Automation and Infrastructure-as-Code Architect and maintain automated workflows using tools such as Ansible, Terraform, PowerCLI, and scripting languages such as PowerShell or Python for provisioning, configuration management, patching, and compliance activities. Build and enhance automation for hypervisor host deployment, cluster configuration, lifecycle management, and policy enforcement. Codify operational procedures and infrastructure standards into repeatable, auditable automation workflows to reduce manual effort and improve consistency. Support infrastructure change through automated validation, testing, and deployment processes that improve quality and reduce risk. Virtualization Ecosystem and Modern Platform Integration Lead engineering and administration of virtualization platforms, including core hypervisor services, cluster design, host profiles, virtual networking coordination, and integration with enterprise storage and backup platforms. Maintain deep expertise in Vmware, as well as other hypervisors and technologies that make up Oshkosh's enterprise compute portfolio. Maintain familiarity with adjacent technologies supporting the virtual infrastructure ecosystem, including hyperconverged platforms, disaster recovery tooling, monitoring systems, VMware Aria Operations, container platforms, and hybrid cloud extensions where applicable. Maintain familiarity with public cloud compute services and how enterprise workloads, recovery strategies, and management practices may extend into Azure, AWS, or similar environments as part of a broader infrastructure portfolio. Collaborate with platform, cloud, and application teams to support evolving infrastructure patterns, including integration with private cloud and container-hosting platforms where virtual infrastructure is foundational. Provide technical leadership on modernization opportunities that improve efficiency, scalability, recoverability, and operational simplicity across the enterprise compute platform. Monitoring, Visibility, and Operational Insight Maintain and enhance platform visibility through enterprise monitoring, alerting, and performance analytics tools, including VMware Aria Operations, Grafana, and related ecosystem tooling, to support rapid issue detection and response. Establish dashboards, alert thresholds, and operational reporting for server hardware health, virtualization performance, resource consumption, capacity trends, and availability. Use telemetry, trend analysis, and platform insights to inform capacity decisions, lifecycle planning, and service improvement initiatives. Partner with enterprise monitoring and operations teams to improve actionable insight across the compute and virtualization landscape. Collaboration and Leadership Serve as a subject matter expert and technical leader for enterprise server hardware and virtualization technologies. Mentor junior engineers and help establish best practices for compute operations, virtualization engineering, automation, lifecycle management, and operational monitoring. Collaborate cross-functionally with infrastructure, security, architecture, and application stakeholders to align platform capabilities with business priorities. Contribute to strategic planning, roadmaps, standards development, and investment recommendations for enterprise compute and virtualization services. YOUR IMPACT The Server Engineer - Lead is essential to ensuring the stability, performance, and evolution of Oshkosh's enterprise compute and virtualization ecosystem. By combining deep server expertise with strong knowledge of VMware, VMware Aria Operations, and other hypervisors, along with operational leadership and automation discipline, this role drives reliability, standardization, and scalability across one of the company's most critical infrastructure foundations. This position is a cornerstone of modern infrastructure operations, balancing day-to-day platform resilience with the engineering rigor required to support future growth, modernization, and digital transformation. An advance understanding of multiple server aspects and environment in order to take ownership of most aspects from end to end. Other duties as assigned. Regular attendance is required.

Requirements

  • Five (5) or more years of experience in the field or in a related area.
  • Monitoring, troubleshooting, customer service, problem solving, cross team collaboration, risk analysis, analytical, operating systems, hardware, infrastructure design, scripting
  • Strong communication, time management, problem solving, teamwork, leadership, mentoring, project management, business acumen, requirements gathering, planning.

Nice To Haves

  • Experience: 7+ years administering and architecting enterprise server infrastructure and virtualization environments at scale.
  • Technical Expertise: Deep understanding of Intel-based enterprise server hardware, VMware vSphere, ESXi, vCenter, VMware Aria Operations, and comparable virtualization platforms, including clustering, virtualization performance tuning, and infrastructure lifecycle management.
  • Platform Operations: Strong experience managing large virtualized environments supporting mission-critical enterprise applications in a highly available and regulated setting.
  • Automation: Proficiency with Ansible, Terraform, PowerCLI, and scripting languages such as PowerShell or Python for infrastructure automation and operational efficiency.
  • Hardware Lifecycle Management: Experience with firmware baselines, hardware compatibility, server provisioning, vendor interoperability, and compute platform refresh strategy.
  • Ecosystem Knowledge: Strong understanding of integration points across compute, storage, networking, backup, disaster recovery, identity, monitoring, and container platforms.
  • Container Familiarity: Familiarity with container management platforms and how virtual infrastructure supports solutions such as OpenShift, Kubernetes, or similar enterprise container ecosystems.
  • Public Cloud Familiarity: Working familiarity with public cloud compute services and adjacent infrastructure patterns in Azure, AWS, or similar environments, with understanding of how they complement enterprise data center operations.
  • Monitoring and Reliability: Experience with enterprise monitoring, alerting, and operational visibility platforms used to manage compute and virtualization health and performance, including VMware Aria Operations or similar platforms.
  • Leadership: Demonstrated ability to lead complex infrastructure initiatives, mentor technical staff, and drive cross-functional operational improvements.
  • Soft Skills: Strong analytical, communication, and problem-solving skills with a focus on platform stability, scalability, and continuous improvement.

Responsibilities

  • Enterprise Compute and Virtualization Platform Management Lead the architecture, implementation, lifecycle management, and operational support of Oshkosh's large-scale enterprise compute environment, including Intel-based server platforms and virtualization platforms based on VMware and other hypervisors.
  • Manage server hardware standards, platform design, firmware strategy, host lifecycle management, and infrastructure refresh planning across the enterprise.
  • Oversee the health, performance, availability, and capacity of virtualization environments, including hypervisor hosts, management platforms, clusters, distributed resource scheduling, and high availability configurations.
  • Design and maintain resilient compute and virtualization solutions that support mission-critical enterprise workloads with strong emphasis on uptime, recoverability, and operational consistency.
  • Partner with storage, networking, backup, security, and application teams to ensure end-to-end performance and reliability of hosted workloads.
  • Establish and maintain operational standards for provisioning, patching, upgrades, host remediation, and configuration consistency across the compute estate.
  • Infrastructure Operations and Platform Reliability Serve as a senior escalation point for complex server hardware and virtualization incidents, leading root-cause analysis and restoration efforts for critical platform issues.
  • Drive proactive management of system health, resource utilization, hardware events, performance bottlenecks, and operational risks across the compute environment.
  • Lead capacity planning and performance optimization efforts for compute, memory, clustering, and virtualization resources to support current and future business demand.
  • Oversee platform resilience through design and support of high-availability, fault-tolerant, and disaster recovery aligned infrastructure services.
  • Ensure enterprise operational readiness for maintenance events, lifecycle transitions, incident response, and business continuity requirements.
  • Automation and Infrastructure-as-Code Architect and maintain automated workflows using tools such as Ansible, Terraform, PowerCLI, and scripting languages such as PowerShell or Python for provisioning, configuration management, patching, and compliance activities.
  • Build and enhance automation for hypervisor host deployment, cluster configuration, lifecycle management, and policy enforcement.
  • Codify operational procedures and infrastructure standards into repeatable, auditable automation workflows to reduce manual effort and improve consistency.
  • Support infrastructure change through automated validation, testing, and deployment processes that improve quality and reduce risk.
  • Virtualization Ecosystem and Modern Platform Integration Lead engineering and administration of virtualization platforms, including core hypervisor services, cluster design, host profiles, virtual networking coordination, and integration with enterprise storage and backup platforms.
  • Maintain deep expertise in Vmware, as well as other hypervisors and technologies that make up Oshkosh's enterprise compute portfolio.
  • Maintain familiarity with adjacent technologies supporting the virtual infrastructure ecosystem, including hyperconverged platforms, disaster recovery tooling, monitoring systems, VMware Aria Operations, container platforms, and hybrid cloud extensions where applicable.
  • Maintain familiarity with public cloud compute services and how enterprise workloads, recovery strategies, and management practices may extend into Azure, AWS, or similar environments as part of a broader infrastructure portfolio.
  • Collaborate with platform, cloud, and application teams to support evolving infrastructure patterns, including integration with private cloud and container-hosting platforms where virtual infrastructure is foundational.
  • Provide technical leadership on modernization opportunities that improve efficiency, scalability, recoverability, and operational simplicity across the enterprise compute platform.
  • Monitoring, Visibility, and Operational Insight Maintain and enhance platform visibility through enterprise monitoring, alerting, and performance analytics tools, including VMware Aria Operations, Grafana, and related ecosystem tooling, to support rapid issue detection and response.
  • Establish dashboards, alert thresholds, and operational reporting for server hardware health, virtualization performance, resource consumption, capacity trends, and availability.
  • Use telemetry, trend analysis, and platform insights to inform capacity decisions, lifecycle planning, and service improvement initiatives.
  • Partner with enterprise monitoring and operations teams to improve actionable insight across the compute and virtualization landscape.
  • Collaboration and Leadership Serve as a subject matter expert and technical leader for enterprise server hardware and virtualization technologies.
  • Mentor junior engineers and help establish best practices for compute operations, virtualization engineering, automation, lifecycle management, and operational monitoring.
  • Collaborate cross-functionally with infrastructure, security, architecture, and application stakeholders to align platform capabilities with business priorities.
  • Contribute to strategic planning, roadmaps, standards development, and investment recommendations for enterprise compute and virtualization services.
  • Other duties as assigned.
  • Regular attendance is required.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service