Principal Network Software and Solution Engineer - Switch Solutions (27649)

Super Micro Computer, Inc.San Jose, CA
53dOnsite

About The Position

Supermicro is seeking an experienced AI Network Software Solution Architect to lead the design and development of next-generation network infrastructure solutions optimized for AI workloads. This role requires deep expertise in GPU fabric design, high-speed switching, network automation, network observability on a scale. You will architect & develop robust and scalable Network solutions in collaboration with external solution providers, VAR, and Supermicro internal teams. Define strategy roadmaps and ensure our networking infrastructure is ready for the most demanding AI platforms. This role will be based on our headquarters located in the San Jose, CA.

Requirements

  • 15-20 years in network engineering or architecture roles, including large-scale data center or AI infrastructure environments
  • Bachelor's degree in computer science, Electrical Engineering, or equivalent experience
  • Strong business acumen: able to balance performance, cost, and scalability in architecture decisions.
  • Customer-Focused Mindset: Experience working closely with customers to design solutions that meet their unique needs and resolving complex technical challenges.
  • Strong Communication & Leadership: Exceptional communication skills, both written and verbal, with an ability to manage relationships, negotiate effectively, and work with high-level executives.
  • Strong hands-on experience with Open Networking switching platforms & SONiC.
  • Proven track record designing data center fabrics using BGP, OSPF, EVPN-VXLAN, and overlay networks
  • Expertise with InfiniBand, RoCEv2, and RDMA-based networking in GPU environments
  • Proficient in network automation using Ansible, Terraform, Python, and Git-based workflows
  • Ability to define business-aligned network strategy roadmaps for scalable AI infrastructure
  • Experience leading HLD/LLD design efforts and technical documentation
  • Strong understanding of telemetry, observability, and proactive network health management

Responsibilities

  • Design & Develop Cutting-Edge Solutions: Develop Network solutions in collaboration with external solution providers, VAR, and Supermicro internal teams.
  • AI-Optimized Network Architecture
  • Design low-latency, high-throughput AI network fabrics (scale-out, scale up, converged) to support GPU traffic patterns for training and distributed inferencing.
  • Fabric Design & Topology
  • Architect RAIL, Clos-based multiplane leaf-spine topologies using 100G/400G/800G infrastructure across various networking platforms.
  • Control Plane & Protocol Integration
  • Design multitenant BGP, EVPN, VXLAN, and routing designs for both scale out, internal cluster traffic and external ingress/egress paths to the internet and cloud.
  • Define and Drive Strategy
  • Define and drive networking strategy aligned with business growth, automation goals, and AI infrastructure scalability.
  • Network Automation & Orchestration
  • Develop infrastructure-as-code workflows using Ansible, Terraform, and Python to automate provisioning, configuration, and monitoring.
  • System Performance & Observability
  • Implement telemetry pipelines and traffic analytics for proactive visibility, capacity planning, and SLA adherence.
  • HLD/LLD Documentation & Standards
  • Develop high-level and low-level network solution design documentation, playbooks, and operational standards to support scalable deployments and troubleshooting.
  • Technology & Market Evaluation
  • Evaluate emerging technologies from NVIDIA, AMD, hyperscalers, and connectivity providers to influence roadmap decisions.
  • Cross-Functional Collaboration & Leadership
  • Work closely with platform, hardware, facilities, and security teams to deliver integrated network solutions and infrastructure for AI/ML workloads.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service