Senior Engineering Manager, Compute

CrusoeSan Francisco, CA
1d$237,000 - $288,000Onsite

About The Position

At Crusoe, we are on a mission to align the future of computing with the future of the climate. As a Senior Engineering Manager on the Compute Team, you will lead the engineers responsible for our vertically integrated AI cloud. This team sits at the intersection of high-performance hardware and cloud-native software, ensuring that our GPU clusters—powered by stranded and renewable energy—deliver world-class performance and reliability for the world’s most demanding AI and HPC workloads. You will manage a high-caliber team of systems and software engineers focused on virtualization, bare-metal provisioning, kernel-level optimization, VM as a Service, and Cloud Hypervisor Development, Open Source contributions. As we look to rapidly scale, your team’s code directly impacts the performance-per-dollar for the world’s leading AI Enterprises. You are building a leaner, faster, and more specialized cloud from the ground up. Your leadership will directly influence how Fortune 500 companies and leading AI researchers access sustainable, hyperscale compute power.

Requirements

  • Leadership Experience: 5+ years of experience in engineering management, specifically leading teams that build distributed systems, cloud infrastructure, or high-performance computing platforms.
  • Technical Depth: A strong background in systems programming (Go, C/C++, or Rust) and a deep understanding of Linux internals and virtualization technologies.
  • Execution at Scale: Proven ability to lead teams through ambiguity and deliver mission-critical software in a fast-paced, high-growth environment.
  • Strategic Mindset: You can bridge the gap between low-level technical trade-offs and high-level business goals, clearly communicating complex concepts to stakeholders.
  • Passion for Sustainability: A genuine interest in Crusoe’s mission to reduce the environmental impact of the AI revolution.

Nice To Haves

  • Experience at a major Cloud Service Provider (CSP) or in a high-scale AI infrastructure company.
  • Familiarity with GPU-based workloads, InfiniBand, or RoCE networking.
  • Contributions to open-source projects in the Linux kernel or virtualization space.

Responsibilities

  • Team Leadership & Growth: Hire, mentor, and scale a world-class team of engineers. You will define performance expectations, foster a culture of technical excellence, and build career growth paths for your direct reports.
  • Compute Infrastructure Strategy: Lead the development and optimization of Crusoe’s compute stack, from bare-metal orchestration to hypervisor tuning (KVM/QEMU) and kernel subsystems (NUMA, memory management, scheduling).
  • High-Performance AI Optimization: Collaborate with hardware and networking teams to optimize performance for massive GPU/TPU clusters, SmartNICs, and high-speed interconnects.
  • Operational Excellence: Oversee the reliability and scalability of our compute services. You will guide the team through complex distributed systems challenges and ensure high availability across our global data center footprint.
  • Cross-Functional Roadmap: Partner with Product, Infrastructure, and Site Reliability Engineering (SRE) to define and execute a roadmap that balances rapid innovation with the stability of a "gold standard" cloud provider.

Benefits

  • Industry competitive pay
  • Restricted Stock Units in a fast-growing, well-funded technology company (Series E)
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid commuter benefit; $300 per month
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service