Staff Storage Systems Architect

LambdaSan Jose, CA
1dOnsite

About The Position

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU. If you'd like to build the world's best AI cloud, join us. Note: This position requires presence in our San Francisco or San Jose office location 4 days per week; Lambda’s designated work from home day is currently Tuesday. We're looking for a Storage Systems Architect experienced in designing, planning, testing, and implementing large-scale distributed storage systems. You will play a critical role in defining our storage infrastructure strategy, driving technical solutions that ensure scalability, reliability, and efficiency for our growing business and operational needs. You will collaborate closely with engineering teams, infrastructure operations, and product stakeholders to ensure our storage solutions align with company objectives and technical requirements.

Requirements

  • Proven experience (7+ years) designing and implementing distributed storage systems.
  • Deep expertise with distributed storage technologies (e.g., Ceph, Lustre, or similar).
  • In-depth knowledge of underlying hardware systems supporting storage deployments.
  • Strong understanding of storage architectures, data replication, redundancy strategies, and performance optimization techniques.
  • Experience working with high-performance and high-availability storage solutions.
  • Familiarity with object, block, and file storage protocols.
  • Ability to identify, analyze, and resolve complex technical issues.
  • Excellent communication skills, capable of clearly articulating technical concepts to diverse stakeholders.

Nice To Haves

  • Expertise in designing, implementing, and managing both enterprise-grade and open-source storage systems.
  • Experience with cloud storage solutions (AWS S3, Azure Blob Storage, Google Cloud Storage).
  • Familiarity with container storage solutions and orchestration.
  • Experience managing storage in large-scale cloud environments.
  • Exposure to data security, compliance, and regulatory requirements (e.g., GDPR, HIPAA).

Responsibilities

  • Architect, design, and implement distributed storage solutions optimized for AI workloads.
  • Drive the development of storage system standards and best practices to ensure consistency and reliability across infrastructure.
  • Evaluate and benchmark storage technologies to meet demanding AI performance requirements.
  • Collaborate with engineering teams to integrate storage solutions with cloud product offerings.
  • Define and maintain storage capacity plans, performance metrics, and scalability roadmaps.
  • Drive operational excellence, ensuring high availability, disaster recovery, and data integrity.
  • Provide mentorship and technical leadership to storage and infrastructure engineers.

Benefits

  • Health, dental, and vision coverage for you and your dependents
  • Wellness and commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible paid time off plan that we all actually use

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1-10 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service