Staff Software Engineer, Model LifeCycle

CrusoeSan Francisco, CA
1d$204,000 - $247,000

About The Position

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure. About this role: The Staff Software Engineer for the Model LifeCycle team will play a key role in building a comprehensive managed platform for the entire application development lifecycle, with a specific focus on leveraging Machine Learning models, including Large Language Models (LLMs).

Requirements

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • 8-10+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
  • Proven track record of delivering production features on time.
  • Experience in using cloud-based services, such as, elastic compute, object storage, virtual private networks, managed database, etc.
  • Experience with Generative AI (Large Language Models, Multimodal).
  • Experience with AI infrastructure, including training, inference.
  • Proactive and collaborative approach with the ability to work independently.
  • Strong communication and interpersonal skills.
  • Passion for building cutting-edge AI products and solving challenging technical problems.

Nice To Haves

  • Proficiency in Golang or Python for large-scale, production-level services.
  • Experience contributing to open-source AI projects.
  • Experience with performance optimizations on GPU systems and inference frameworks.
  • Experience working with PyTorch
  • Experience with training and fine-tuning LLMs

Responsibilities

  • Contribute to fine-tuning systems for large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling.
  • Implement and maintain end-to-end training pipelines for Large Language Models.
  • Contribute to distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling).
  • Develop and maintain agent execution infrastructure.
  • Implement features for dataset, model, and experiment management: versioning, lineage, evaluation, and reproducible fine-tuning at scale.
  • Work closely with Principal Engineers, product, business, and platform teams to implement the core abstractions and APIs of the system.
  • Contribute to architectural decisions around training runtimes, scheduling, storage, and model lifecycle management.
  • Engage with the open-source LLM ecosystem.
  • This role offers significant scope for ownership — you'll be implementing and contributing to the design of core systems.

Benefits

  • Industry competitive pay
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid commuter benefit; $300/month
  • Compensation will be paid in the range of up to $204,000 - $247,000 + Bonus. Restricted Stock Units are included in all offers. Compensation to be determined by the applicants knowledge, education, and abilities, as well as internal equity and alignment with market data.
  • Crusoe is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, disability, genetic information, pregnancy, citizenship, marital status, sex/gender, sexual preference/ orientation, gender identity, age, veteran status, national origin, or any other status protected by law or regulation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service