Apple Inc.-posted 15 days ago
Full-time • Mid Level
Sunnyvale, CA
5,001-10,000 employees
Computer and Electronic Product Manufacturing

As a Network Systems Integration Engineer, you will build and maintain the critical infrastructure that enables our hardware innovation. Your primary mission is to provide robust, hands-on support for our data center development labs, ensuring our electrical, validation, and cross-functional hardware engineering teams have a reliable and performant environment. This requires a strong foundation in systems integration and network architecture, applied through daily tasks such as configuring switches, provisioning systems, and automating operational workflows with custom scripts. Come join us! Leveraging your deep expertise in server platform integration and high-performance networking, you will architect, deploy, and sustain the complex lab infrastructure essential for validating next-generation compute platforms. Your responsibilities will include the end-to-end management of high-bandwidth fabrics, lossless interconnects, and out-of-band networks. In this highly collaborative role, you will partner with cross-functional organizations-including Hardware Engineering, Operations, Security, and Systems Validation-to guarantee the readiness, performance, and reliability of our dynamic Ru0026D environment.

  • M.S. Degree with minimum 3-5 years experience.
  • Proven experience designing, configuring, and validating RoCEv2 (RDMA over Converged Ethernet) fabrics. This must include hands-on work with Priority Flow Control (PFC) and Explicit Congestion Notification (ECN) on data center switches.
  • Deep expertise in configuring and tuning the Linux network stack for high-performance workloads, including direct experience with RDMA drivers (e.g., NVIDIA OFED), kernel bypass methods, and optimizing network performance through CPU affinity and interrupt handling.
  • Demonstrated ability to root-cause performance bottlenecks across the entire stack. This includes using tools to measure and analyze application-level throughput, RDMA-level latency and correlating it with system metrics (CPU, memory) and network-level statistics (packet drops, PFC pause frames).
  • Experience troubleshooting elusive integration issues that manifest as network problems, but originate in server hardware or firmware. This includes debugging interactions between BIOS/UEFI settings, PCIe lane allocation, NIC firmware, and OS driver behavior.
  • Experience using Ansible or Python to configure multi-vendor switch fabrics, deploy custom OS images via PXE, and run automated test suites to validate fabric health and performance.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service