Staff Software Engineer, Network Automation

CrusoeSunnyvale, CA
$215,000 - $260,000Onsite

About The Position

Crusoe is seeking a Staff Network Automation Engineer to design and deliver the automation frameworks, observability platforms, and self-healing workflows that keep our global network fleet running at scale. This role sits at the intersection of Network Engineering, Software Engineering, and Infrastructure Reliability — and is central to Crusoe's mission of building AI infrastructure that scales intelligently, not just bigger. This is a full-time role for someone who wants to work on hard problems at hyperscale.

Requirements

  • 8+ years of network engineering experience with a demonstrated focus on production network automation and infrastructure as code
  • Production-quality software engineering skills in Python or Go, with CI/CD integration and platform-level thinking
  • Hands-on experience with model-driven automation including gNMI, OpenConfig, NETCONF, and YANG
  • Experience contributing to or owning a network source of truth platform such as NetBox or Nautobot
  • Strong knowledge of Arista (EOS) and/or Juniper (Junos) in leaf-spine DC fabric environments
  • Solid understanding of BGP, EVPN-VXLAN, and LLDP at data center scale
  • Experience building or contributing to observability platforms using Prometheus, Grafana, and streaming telemetry tooling

Nice To Haves

  • Experience with NVIDIA/Mellanox platforms in production environments
  • Familiarity operating at fleet scale across thousands of network devices in multi-region environments
  • Exposure to closed-loop, event-driven automation and auto-remediation systems
  • Experience in hyperscale or internet-scale infrastructure (cloud providers, large CDNs, or AI/ML infrastructure companies)

Responsibilities

  • Contribute to the technical roadmap for Crusoe's automation stack, from source of truth and config generation through day-2 operations and closed-loop remediation across our global fleet.
  • Help design and maintain the authoritative data model (NetBox, Nautobot, or equivalent) that drives network configuration, validation, and operational state across teams.
  • Build and maintain declarative, model-driven configuration systems using Python, Nornir, Ansible, and Jinja2, treating the network as code and eliminating configuration drift.
  • Contribute to Crusoe's gNMI, OpenConfig, and NETCONF/YANG strategy for telemetry collection, configuration management, and state validation across multi-vendor fabrics.
  • Build and maintain event-driven auto-remediation systems that detect faults, correlate telemetry, and resolve known failure modes without human escalation.
  • Help build and improve Crusoe's telemetry, metrics, alerting, and dashboarding stack including Prometheus, Grafana, and streaming gNMI collectors.
  • Work closely with Network Architecture to ensure designs are automation-first — deployable, validatable, and operable programmatically at scale.

Benefits

  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Paid time off, paid holidays & leave of absence programs
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Additional perks & programs specific to location
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service