Mirantis is seeking a commercially driven, deeply technical Product Manager to lead AI inference and model serving for k0rdent AI, their control plane for GPU infrastructure and distributed AI workloads. This role is at the intersection of AI inference, cloud-native infrastructure, distributed systems, and performance engineering. The Product Manager will define how customers deploy, scale, and operate production inference services while optimizing performance from GPU, network, and storage infrastructure. The role encompasses product strategy and solution development for inference products across on-premises, cloud, and edge environments, including serverless inference, dedicated endpoints, workload placement, autoscaling, routing, lifecycle management, observability, and full-stack performance optimization. The goal is to enable customers to run production model-serving workloads at scale, improving latency, throughput, utilization, reliability, cost, and operational control.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed