Senior Software Engineer, Storage Engine

CoreWeaveBellevue, WA
1d$139,000 - $204,000Hybrid

About The Position

The Storage Engine Organization at CoreWeave is responsible for the product capabilities and data plane function of CoreWeave’s managed storage products. We build reliable, scalable storage solutions with segment leading performance. Storage engine works with engineering teams across infrastructure, compute, and platform to ensure our storage services meet the needs of the world’s most demanding AI workloads. About the role: Design and Implement distributed storage solutions to support scaling data intensive AI workloads. Contribute to the development of exabyte-scale, S3-compatible object storage and integrate dedicated storage clusters into diverse customer environments. Work with technologies such as RDMA, GPU Direct Storage, and distributed filesystems protocols such as NFS or FUSE to optimize storage performance and efficiency. Participate in efforts to improve the reliability, durability, and observability of our storage stack. Collaborate with operations teams to monitor, troubleshoot, and improve storage systems in production environments. Help develop metrics and dashboards to provide visibility into storage performance and health. Build proactive automation to ensure our systems have a consistent performance envelope for customer workloads. Analyze telemetry and system data to drive improvements in throughput, latency, and resilience. Work cross-functionally with platform, product, and infrastructure teams to deliver seamless storage capabilities across the stack. Share your knowledge and mentor other engineers on best practices in building distributed, high-performance systems.

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • 4–8 years of experience working in storage systems engineering or infrastructure.
  • Strong hands-on experience with object storage or distributed filesystems in production environments.
  • Experience with one or more storage protocols (e.g. S3, NFS) and file systems such as Ceph, DAOS, or similar.
  • Proficiency in a systems programming language such as Go, C, or Rust.
  • Familiarity with storage observability tools and telemetry pipelines (e.g., ClickHouse, Prometheus, Grafana).
  • Experience working with cloud-native infrastructure, Kubernetes, and scalable system architecture.

Nice To Haves

  • You love to grow and push the boundaries of your expertise
  • You’re curious about distributed storage and the demands AI/ML workloads
  • You’re an expert in data persistence on physical media, high performance data transfer using RDMA, or resilient distributed systems.

Responsibilities

  • Design and Implement distributed storage solutions to support scaling data intensive AI workloads.
  • Contribute to the development of exabyte-scale, S3-compatible object storage and integrate dedicated storage clusters into diverse customer environments.
  • Work with technologies such as RDMA, GPU Direct Storage, and distributed filesystems protocols such as NFS or FUSE to optimize storage performance and efficiency.
  • Participate in efforts to improve the reliability, durability, and observability of our storage stack.
  • Collaborate with operations teams to monitor, troubleshoot, and improve storage systems in production environments.
  • Help develop metrics and dashboards to provide visibility into storage performance and health.
  • Build proactive automation to ensure our systems have a consistent performance envelope for customer workloads.
  • Analyze telemetry and system data to drive improvements in throughput, latency, and resilience.
  • Work cross-functionally with platform, product, and infrastructure teams to deliver seamless storage capabilities across the stack.
  • Share your knowledge and mentor other engineers on best practices in building distributed, high-performance systems.

Benefits

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service