About The Position

We’re looking for an experienced Site Reliability Engineer III with a deep focus on networking to help design, build, and operate the critical network infrastructure that supports our global platforms. You’ll work across cloud and on‑prem environments, ensuring our systems are reliable, scalable, secure, and observable. This role blends hands‑on engineering with architectural thinking and cross‑team collaboration.

Requirements

  • 5–8 years of experience in SRE, network engineering, or infrastructure roles with production ownership.
  • Deep understanding of networking fundamentals including TCP/IP, DNS, BGP, OSPF, VLANs, and firewalls.
  • Hands‑on experience with cloud networking (AWS VPC, Transit Gateway, Azure VNets, GCP VPC).
  • Proficiency with network automation tools such as Ansible, Terraform, or vendor‑specific APIs.
  • Experience with load balancers and traffic management (NGINX, Envoy, cloud LB services).
  • Strong Linux systems knowledge and comfort troubleshooting at multiple layers.
  • Programming or scripting skills in Python, Go, or similar languages.
  • Experience with observability platforms such as Prometheus, Grafana, ELK, or OpenTelemetry.
  • Proven track record in incident management for network‑related outages.

Nice To Haves

  • Experience with SDN technologies (e.g., Cisco ACI, VMware NSX).
  • Knowledge of zero‑trust networking principles and modern security architectures.
  • Background with distributed systems networking (Kafka, microservices, multi-cluster).
  • Vendor certifications such as CCNP, CCIE, JNCIP, or cloud networking specialties.
  • Experience contributing to network automation tooling or open‑source projects.

Responsibilities

  • Design and maintain core network infrastructure across cloud, data center, and hybrid environments.
  • Implement and optimize network automation to reduce manual operations and improve consistency.
  • Develop and maintain network observability including metrics, logs, tracing, and alerting for network‑centric systems.
  • Lead incident response for network‑related issues and drive root‑cause analysis and long‑term remediation.
  • Collaborate with SRE, security, and platform teams to ensure network reliability aligns with service needs.
  • Manage and evolve load balancing, routing, and traffic management for high‑availability systems.
  • Define and enforce network reliability standards including SLOs, SLIs, and error budgets.
  • Harden network security through segmentation, policy enforcement, and best‑practice configurations.
  • Support capacity planning and performance tuning for network‑heavy workloads.
  • Mentor junior engineers and contribute to a culture of operational excellence.

Benefits

  • You may also be offered incentive compensation, bonus, restricted stock units, and benefits.
  • More details about F5’s benefits can be found at the following link: https://www.f5.com/company/careers/benefits.
  • F5 reserves the right to change or terminate any benefit plan without notice.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service