Senior Staff Network Engineer, Automation

CrusoeSan Francisco, CA
$245,000 - $295,000

About The Position

Crusoe is seeking a Senior Staff Network Automation Engineer to own how our network monitors, configures, and heals itself without human intervention. You will design and ship the automation frameworks, self-healing workflows, and observability platforms that allow our global network fleet to scale sublinearly with headcount. This is a senior technical leadership role at the intersection of Network Engineering, Software Engineering, and Infrastructure Reliability. You will partner with Network Architecture to translate intent-based designs into production automation, mentor a team of Staff and Senior engineers, and represent the automation org in cross-functional planning with Deployment, Operations, and Site Reliability. You're not writing scripts to replace manual tasks. You're building the control plane that makes the network intelligent.

Requirements

  • 12+ years of network engineering experience with a demonstrated focus on production network automation, platform engineering, and infrastructure as code in hyperscale or internet-scale environments.
  • Demonstrated Technical Leadership: Proven track record of designing and shipping network automation platforms used by a broader engineering organization, not scripts, but systems others build on.
  • Production-Quality Software Engineering: Mastery of Python or Go at a platform level, testable, CI/CD-integrated, and production-owned. You think like a software engineer who deeply understands networking.
  • Model-Driven Automation Fluency: Deep hands-on experience with gNMI, OpenConfig, NETCONF, and YANG-modeled configuration and telemetry. You understand why model-driven automation is the only path to hyperscale.
  • Source of Truth Ownership: You have designed or owned a network source of truth platform (NetBox, Nautobot, or equivalent) end to end, including the data model, integrations, and CI/CD pipelines that consume it.
  • Network Domain Depth: Strong hands-on expertise with Arista (EOS), Juniper (Junos), and NVIDIA/Mellanox platforms in leaf-spine architectures. Solid knowledge of BGP, EVPN-VXLAN, and LLDP at DC fabric scale.
  • Event-Driven and Self-Healing Systems: Track record of building auto-remediation and closed-loop automation that detects, correlates, and resolves faults without human intervention.
  • Observability Expertise: Hands-on experience building streaming telemetry and observability platforms using gNMI collectors, Prometheus, Grafana, and equivalent tooling at fleet scale.
  • Hyperscale Operational Context: Comfort operating at scale across 10K+ network devices and multi-region fabric fleets, where the blast radius of a bug is measured in racks, not ports.
  • Education: Bachelor's degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience in hyperscale or internet-scale environments

Responsibilities

  • Own the Network Automation Platform: Define the technical roadmap for Crusoe's automation stack, from source of truth and config generation through day-2 operations, telemetry, and closed-loop remediation across our global fleet.
  • Build the Source of Truth: Design and own the authoritative data model (NetBox, Nautobot, or equivalent) that drives all network configuration, validation, and operational state across teams.
  • Architect Intent-Based Configuration Systems: Lead the design and delivery of declarative, model-driven configuration pipelines using Python, Nornir, Ansible, Jinja2, and CI/CD, treating the network as code and making configuration drift impossible.
  • Drive Model-Driven Automation: Own Crusoe's gNMI, OpenConfig, and NETCONF/YANG strategy for telemetry collection, configuration management, and state validation across multi-vendor fabrics (Arista, Juniper, NVIDIA/Mellanox).
  • Build Self-Healing Workflows: Design and ship event-driven, auto-remediation systems that detect faults, correlate telemetry, and resolve known failure modes without human escalation.
  • Define the Observability Platform: Set the technical direction for Crusoe's telemetry, metrics, alerting, and dashboarding stack including Prometheus, Grafana, and streaming gNMI.
  • Influence Architecture for Automability: Partner with Network Architecture to ensure designs are automation-first from day one, deployable, validatable, and operable programmatically at scale.
  • Mentor and Multiply: Provide technical guidance to Staff and Senior engineers. Drive code reviews, design reviews, and platform architecture decisions that raise the engineering bar across the org.

Benefits

  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Paid time off, paid holidays & leave of absence programs
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Additional perks & programs specific to location
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service