IT Storage Engineer

SpaceXHawthorne, CA
111d$120,000 - $160,000

About The Position

SpaceX is seeking an experienced storage engineer with deep expertise in both enterprise storage platforms and high-performance computing (HPC) storage environments. This role is ideal for someone who thrives in performance-critical settings and is eager to build storage infrastructure supporting mission-critical workloads like simulation, telemetry, and test data processing. The ideal candidate will possess a hybrid skill set—combining enterprise storage experience with HPC expertise, particularly in designing low-latency RoCEv2 and InfiniBand storage networks and deploying parallel file systems optimized for scale.

Requirements

  • Experience with enterprise storage platforms: NetApp, Pure Storage, Scality, Vast.
  • Strong understanding of storage protocols: NFS, CIFS/SMB, iSCSI.
  • 5+ years of experience in IT storage engineering.
  • Experience with client and server hardware/software, monitoring tools, enterprise networking, and virtualization.

Nice To Haves

  • Mid-level experience with DFS administration.
  • Deep knowledge of networking including RDMA, congestion control tuning, and lossless Ethernet.
  • Experience automating infrastructure with Python, PowerShell, or Ansible.
  • Familiarity with HPC schedulers (e.g., Slurm, LFS) and how storage tiers interact with job scheduling.
  • 2+ years of experience deploying and supporting RoCEv2 or InfiniBand-based storage infrastructure in HPC or latency-sensitive environments.
  • Strong written and verbal communication skills; able to explain complex designs in clear terms.
  • Experience collaborating with virtualization teams as an infrastructure engineer.

Responsibilities

  • Design, implement, and maintain storage infrastructure across multiple tiers: high-performance, capacity, archive, backup/recovery, and disaster recovery.
  • Architect and operate high-throughput, low-latency storage fabrics using RoCEv2 and/or InfiniBand for compute-intensive environments.
  • Engineer and support parallel file systems (e.g., VAST, Lustre, BeeGFS) for high concurrency workloads in simulation and telemetry analysis.
  • Administer storage systems including NetApp, Pure FlashArray/FlashBlade, Cohesity, and Scality.
  • Monitor and troubleshoot storage performance across the full stack: hardware, transport (RDMA), and protocol layers.
  • Troubleshoot complex issues that require an understanding of the interaction of various protocols and operating systems.
  • Manage storage access, data permissions, and access controls across different protocols and operating systems.
  • Develop automation and tooling for monitoring, capacity utilization and planning, configuration management, and alerting.
  • Define and enforce standards for storage deployment, performance tuning, and operational management.
  • Collaborate with compute, networking, and application teams to align storage strategies with system architecture and workload requirements.
  • Participate in scheduled maintenance, on-call rotation, and occasional physical installation of hardware.

Benefits

  • Pay Range: $120,000.00 - $160,000.00/year
  • Long-term incentives, in the form of company stock, stock options, or long-term cash awards.
  • Potential discretionary bonuses.
  • Ability to purchase additional stock at a discount through an Employee Stock Purchase Plan.
  • Comprehensive medical, vision, and dental coverage.
  • Access to a 401(k) retirement plan.
  • Short and long-term disability insurance.
  • Life insurance.
  • Paid parental leave.
  • Various other discounts and perks.
  • 3 weeks of paid vacation.
  • 10 or more paid holidays per year.
  • 5 days of sick leave for exempt employees.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service