As part of the Cluster Orchestration team, you will play a key role in advancing CoreWeave’s orchestration platform including SUNK (Slurm on Kubernetes) and beyond our Kubernetes-native foundation that powers AI training and inference at scale. This is an opportunity to help shape one of the most critical layers of the AI cloud: ensuring workloads run seamlessly, reliably, and efficiently across massive GPU clusters. By building the systems that eliminate infrastructure bottlenecks and create new orchestration capabilities, you will directly empower customers to innovate faster and push the boundaries of what’s possible with AI. About the role: As a Senior Software Engineer I (IC3), you will own multiple services within the orchestration platform. You’ll lead design/code reviews, decompose projects into milestones, and drive measurable improvements in reliability and performance. You’ll define SLIs/SLOs for your services, strengthen operational practices, and mentor IC1/IC2 engineers. Your work will ensure customers see consistent improvements in throughput, latency, and system resilience.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
1,001-5,000 employees