As a Member of Technical Staff at Fluidstack, you will design, develop, and maintain software solutions that power our AI infrastructure and enable our customers to run complex ML workloads efficiently at scale. Your responsibilities are aligned with the success of our customers and your teammates, and you'll work side-by-side with them to push forward the state of the art in AI/ML. A day's work may include: Developing and optimizing job scheduling systems to maximize GPU utilization and throughput for ML workloads, Building and improving software interfaces for cluster management that support PyTorch, JAX, and other ML frameworks, Creating monitoring and observability tools for tracking training progress, resource usage, and system performance, Implementing data pipeline optimizations to accelerate training and inference workflows, Designing and developing APIs and services to integrate with MLflow, Kubeflow, Weights & Biases, and other ML tooling, Writing libraries and utilities to simplify the deployment and management of distributed training jobs.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level