Software Engineer, Control Plane

CrusoeSan Francisco, CA
$136,000 - $160,500

About The Position

We are seeking a Software Engineer, Control Plane to help build and scale the next generation of Crusoe Cloud’s Control Plane. As a member of the Compute team, you will contribute to the high-availability systems that manage our global fleet of AI-optimized compute, network, and storage resources. In this role, you will transition from executing defined tasks to taking ownership of specific services and features within our IaaS platform. You will ensure that as Crusoe scales to meet massive AI demand, our control plane remains fault-tolerant, performant, and seamlessly integrated with our underlying high-performance hardware.

Requirements

  • Feature Implementation: Write high-quality, production-ready code in Go to handle resource state transitions and API orchestration.
  • Design for Reliability: Contribute to design docs for multi-region control plane services, focusing on edge cases and failure modes.
  • Scaling & Optimization: Help identify and resolve bottlenecks in our resource orchestration layer using tools like Kubernetes, Go, and Postgres.
  • Production Support: Work alongside SREs to investigate production issues, perform root cause analysis, and implement long-term fixes to harden our environment.
  • 3+ years of software development experience with proficiency in a modern compiled language—Go is highly preferred, but Rust or C++ are also valued.
  • Experience building and deploying distributed systems or managed cloud services in a production environment.
  • Working knowledge of the modern infrastructure stack, including Kubernetes, Docker, Terraform, and Postgres.
  • A solid understanding of how cloud resources (Compute, Network, Storage) are abstracted and managed.
  • A "product-minded" approach to engineering, with the ability to understand how technical decisions impact the end-user experience.
  • Strong communication skills and the ability to explain technical trade-offs within your immediate team.

Responsibilities

  • Develop and Maintain Microservices: Build and optimize the scalable services that power the Crusoe Cloud control plane and manage complex virtualized resource lifecycles.
  • Implement IaaS Primitives: Contribute to the backend logic that ensures high throughput and low-latency API responses for large-scale cluster provisioning.
  • Cross-Functional Collaboration: Partner with Product, Networking, and Storage teams to implement features that create differentiated cloud solutions for AI/ML customers.
  • Operational Excellence: Improve the observability, testing coverage, and reliability of our distributed systems through clean code and robust CI/CD practices.
  • Technical Growth: Participate in rigorous design reviews, contribute to team documentation, and help onboard new engineers to the team’s codebase.

Benefits

  • Competitive compensation and equity packages
  • Restricted Stock Units
  • Paid time off, paid holidays & leave of absence programs
  • Comprehensive health, dental & vision insurance
  • Employer contributions to HSA account
  • Paid parental leave
  • Paid life insurance, short-term and long-term disability
  • Professional development & tuition reimbursement
  • Mental health & wellness support
  • Commuter benefits (parking & transit)
  • Cell phone stipend
  • 401(k) Retirement plan with company match up to 4% of salary
  • Volunteer time off
  • Global travel insurance & emergency assistance
  • Daily meals allowance
  • Additional perks & programs specific to location

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

251-500 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service