Senior Network Engineer

Together AISan Francisco, CA
24d

About The Position

As a Senior Network Engineer at Together, you are responsible for designing, implementing, and maintaining our network infrastructure to ensure seamless connectivity and optimal performance for all user-facing services and production systems. As both a strategic planner and a hands-on engineer, you apply sound networking principles, operational discipline, and advanced automation to our network environments. You specialize in networking systems—including routing, switching, network security, and protocols—implementing best practices for availability, reliability, and scalability. You have a keen interest in network design, optimization, and emerging technologies in HPC-based data center networking. Outstanding problem-solving abilities and a comprehensive understanding of fundamental network theory are also critical to your success.

Requirements

  • 8+ years of professional experience building, managing, and supporting large-scale hybrid data center networks (excluding enterprise networks).
  • High level of proficiency with TCP/IP networking architecture and technologies such as BGP, OSPF, VXLAN, EVPN, and QoS.
  • Experience developing network automation pipelines using Python, Ansible, or other languages/tools utilized in infrastructure automation.
  • Proficient in using tools such as Wireshark, tcpdump, nmap, MTR, and curl to identify connectivity issues, latency problems, and network bottlenecks.
  • Experience designing and supporting multi-tenant networks
  • Hands-on experience deploying and supporting network devices from Cisco, Arista, Juniper, and Mellanox.
  • Experience working with cloud networks such as AWS, GCP, and Azure.
  • Solid experience working in and troubleshooting within a Linux environment.

Nice To Haves

  • Knowledge of RoCE and Infiniband protocols a plus
  • Experience with Docker, Kubernetes, or Slurm a plus
  • Understanding of AI training workloads and the demands they exert on networks a plus

Responsibilities

  • Design, deploy, manage and maintain global multi-vendor, multi-protocol high performance compute networks.
  • Analyze data to diagnose and identify root causes to network issues to minimize downtime
  • Evaluate and recommend network technologies, hardware, and software solutions.
  • Participate in design reviews to ensure the proposed network architecture aligns with business needs and is optimized for performance, scalability, and reliability.
  • Manage relationships with external vendors and partners to test and verify hardware and software selections.
  • Develop, and deploy systems and tools to keep all networks running reliably and efficiently
  • Establish and implement industry best practices and contribute to the design of new scalable network solutions
  • Ensure compliance with IT governance standards and best practices.
  • Lead projects to address complex technical challenges, directly contributing to roadmaps and partner alongside the best engineers in the industry to develop world-class solutions

Benefits

  • We offer competitive compensation, startup equity, health insurance and other competitive benefits.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

101-250 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service