Systems Engineer, Kernel (Networking)

CoreWeaveNew York City, NY
14h$153,000 - $242,000Hybrid

About The Position

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com . Senior Systems Engineer, Kernel Networking CoreWeave is seeking a specialized Kernel Networking Engineer to join our HAVOCK Team. In this role, you will be the subject matter expert for the networking subsystem of CoreWeave’s Linux-based infrastructure. As we scale our massive AI/HPC clusters, you will focus on optimizing the datapath, tuning the TCP/IP and RDMA stacks, and ensuring the stability of high-throughput workloads across NVIDIA, Mellanox, and Broadcom hardware. H ardware - A cceleration - V irtualization - O perating Systems - C ontainerization - K ubelet Our Team’s Stack: Python, Go, bash/sh, C Custom Linux Kernel, Ubuntu Debug Tools: crash, kdump, drgn, gdb Prometheus, Victoria Metrics, Grafana, Loki Docker, kubernetes (k8s), KubeVirt Focus Areas: Holistic Troubleshooting – Act as the first line of defense for complex system crashes, soft lockups, and kernel panics. Cross-Domain Debugging – Identify whether a root cause lies in memory management, storage, or the network layer. Incident Response – Reduce "Mean-Time-To-Resolution" by quickly analyzing crash dumps and stack traces. Reliability Engineering – Contribute to the "Smarter Triaging" initiative to automate crash analysis. Fleet Stability – Ensure kernel support across diverse hardware (CPUs, GPUs, DPUs).

Requirements

  • 5+ years of experience in systems-level development or kernel engineering.
  • Broad Kernel Knowledge: Solid grasp of memory management, scheduling, and filesystems.
  • Networking Fluency: Proven record troubleshooting RoCE, IB, and RDMA issues.
  • Debugging Mastery: Expert capability with standard utilities and a systematic approach to root-cause analysis.
  • Excellent verbal and written communication skills (ability to explain complex kernel bugs to stakeholders).

Nice To Haves

  • Experience with eBPF for troubleshooting.
  • Knowledge of GPU/NVLink architectures.
  • Experience working with automated monitoring/alerting systems (Grafana, Jira automation).
  • Willingness to present at conferences (LPC, LSFMMBPF).

Responsibilities

  • Analyze kernel crashes, oopses, and panics across the entire stack.
  • Apply specific networking knowledge to troubleshoot issues with NVIDIA/Mellanox/Broadcom NICs.
  • Utilize crash dump analysis (kdump, crash, drgn) to triage issues affecting customer workloads.
  • Improve documentation and RCA processes for kernel failures.
  • Assist in maintaining kernel builds and CI/CD pipelines to streamline testing.

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service