Mirantis is seeking a commercially driven, deeply technical Product Manager to lead AI inference and model serving for k0rdent AI, their control plane for GPU infrastructure and distributed AI workloads. This role is at the intersection of AI inference, cloud-native infrastructure, distributed systems, and performance engineering. The Product Manager will define how customers deploy, scale, and operate production inference services while maximizing performance from underlying GPU, network, and storage infrastructure. This role is responsible for product strategy and solution development for inference products across on-premises, cloud, and edge environments. The scope includes serverless inference, dedicated endpoints, workload placement, autoscaling, routing, lifecycle management, observability, and full-stack performance optimization. The goal is to define how customers run production model-serving workloads at scale while improving latency, throughput, utilization, reliability, cost, and operational control. The ideal candidate will have experience with high-performance infrastructure products, understand production systems under real-world load, be comfortable reasoning across the full stack, identify performance bottlenecks, evaluate system design trade-offs, and translate technical insights into clear product requirements, architecture direction, and customer-facing solutions.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed