Senior Manager - Networking Engineering

Core42 US Services LLC
Remote

About The Position

The Senior Manager of Network Engineering will lead the strategy, design, deployment, and operational excellence of network infrastructure supporting large-scale GPU, AI, and HPC environments. This role is responsible for building and managing high-performing network engineering teams while ensuring the reliability, scalability, security, and performance of multi-tenant and high-density network environments. The ideal candidate brings deep expertise in AI and HPC network architectures, including high-speed Ethernet, InfiniBand, RoCEv2, segmentation, routing, firewall policy, and multi-site connectivity. This leader will partner closely with infrastructure, security, facilities, operations, and executive leadership to deliver resilient network platforms that support rapid growth and demanding performance requirements.

Requirements

  • 10+ years of experience in network engineering, including large-scale data center, HPC, cloud, or AI environments.
  • 5+ years of leadership experience managing network engineering teams or programs.
  • Deep expertise in data center networking: BGP, EVPN/VXLAN, VRFs, VLANs, routing, switching, and segmentation.
  • Strong experience with InfiniBand, RoCEv2, and high-speed Ethernet in GPU or HPC environments.
  • Proven experience designing and operating secure multi-tenant network architectures.
  • Strong understanding of firewall platforms and network security controls in complex environments.
  • Experience working with ISPs, carriers, and vendors for provisioning and escalation management.
  • Hands-on experience with network monitoring, telemetry, packet capture, and troubleshooting tools.
  • Demonstrated ability to build, lead, and scale technical engineering teams.
  • Strong communication skills and ability to translate technical strategy into business impact.

Nice To Haves

  • Experience in GPUaaS, AI cloud, sovereign cloud, or large-scale HPC environments.
  • Knowledge of zero trust architecture and microsegmentation.
  • Familiarity with data center physical infrastructure (power, cooling) as it relates to networking.
  • Experience with infrastructure-as-code and network automation.
  • Certifications such as CCNP, CCIE, JNCIP, PCNSE, or equivalent.
  • Experience with multi-site or geographically distributed network environments.

Responsibilities

  • Develop and execute the network engineering strategy for large-scale AI, GPU, and HPC infrastructure, ensuring scalability, performance, and operational resilience.
  • Lead the design, implementation, and lifecycle management of high-performance network environments, including Ethernet, InfiniBand, RoCEv2, management, storage, and tenant-isolated network fabrics.
  • Oversee multi-tenant network architectures, including segmentation, VRFs, VLANs, routing domains, and secure traffic flow between environments.
  • Establish and enforce network security standards, including firewall policy design, access controls, and traffic inspection aligned with organizational and regulatory requirements.
  • Partner with infrastructure engineering, security, and operations teams to ensure network architecture integrates with compute, storage, and data center design.
  • Collaborate with ISPs, carriers, and hardware vendors for service delivery, escalation management, capacity planning, and performance optimization.
  • Drive network observability through monitoring, alerting, logging, and performance analysis across critical services and transport layers.
  • Lead incident response and root cause analysis for major network events, ensuring timely resolution and continuous improvement.
  • Build, mentor, and manage a high-performing network engineering team, setting priorities, development plans, and technical standards.
  • Define and maintain documentation standards for architecture, topology, policies, procedures, and change management.
  • Drive automation and standardization using tools such as Ansible, Python, or equivalent frameworks.
  • Establish and track KPIs for availability, latency, throughput, incident response, and network reliability.
  • Serve as the primary network engineering liaison to senior leadership, providing roadmap recommendations and risk assessments.
  • Evaluate emerging technologies to improve performance, security, and scalability across AI infrastructure environments.

Benefits

  • bonus
  • LTIP
  • benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service