Principal Systems Engineer

ARMAustin, TX
87d$297,600 - $402,600Hybrid

About The Position

We are seeking a highly skilled and motivated Systems Engineer with a niche focus on network interconnects within blade and rack-level systems. This role is pivotal in designing, analyzing, and optimizing the physical and logical interconnection of compute, storage, and accelerator components in high-performance systems! The ideal candidate will possess deep expertise in networking standards, topologies, and technologies used in datacenters, HPC systems, and cloud-scale infrastructure.

Requirements

  • Bachelor's or Master's degree in Electrical Engineering, Computer Engineering, or related field.
  • 5+ years of experience in systems or network engineering, preferably in datacenter or HPC environments.
  • Strong understanding of rack-level network topologies and high-speed interconnect technologies.
  • Hands-on experience with Ethernet and/or InfiniBand networking, including QSFP/OSFP modules and switching.
  • Familiarity with Layer 1-3 networking protocols and performance tuning.
  • Experience with simulation tools and methodologies for network performance analysis.

Nice To Haves

  • Knowledge of CXL, NVLink, or PCIe-based interconnects.
  • Experience with Open Compute Project (OCP) hardware and standards.
  • Experience in thermal and power modeling related to networking hardware.
  • Scripting skills (Python, Bash) for automation and analysis.

Responsibilities

  • Define and architect rack-level network topologies (e.g., 200/400/800G Ethernet, Smart NICs, Network Security, etc.) for scalable system designs.
  • Collaborate with hardware and software engineering teams to design network interconnect solutions aligned with performance, power, and cost goals within the rack.
  • Evaluate and select network fabrics (e.g., Ethernet, InfiniBand, CXL, PCIe) based on system requirements.
  • Model and simulate network performance across rack-level systems.
  • Perform bottleneck analysis and propose optimizations for bandwidth, latency, and reliability.
  • Collaborate with platform and board teams to ensure seamless integration of interconnects at the physical and logical levels.
  • Define test plans and support bring-up and validation of networking subsystems.
  • Work with Industry Network vendors to ensure solutions meet the technical requirements and features required for AI/ML systems.
  • Maintain awareness of evolving standards (e.g., IEEE 802.3, OCP NIC, OSFP, QSFP-DD) and contribute to the development of internal guidelines.
  • Ensure systems meet regulatory and industry compliance for networking hardware.
  • Partner with software, firmware, and datacenter infrastructure teams to ensure end-to-end functionality and compatibility.
  • Work with vendors and technology partners for roadmap alignment and issue resolution.
  • Work closely with ODM partners to ensure integration and test requirements are developed and ensure proper validation coverage.

Benefits

  • Competitive compensation and benefits.
  • Opportunities for career growth and continuous learning.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service