At Modular, we optimize inference from kernel to cloud on one unified stack. We are building a differentiated cloud platform that delivers state of the art inference performance from day one, then keeps getting better. As we learn the shape and patterns of each customer's workload, the platform adapts and improves performance automatically over time. The Performance Labs team builds the infrastructure that makes this possible at scale. We continuously apply the latest optimizations across kernels, the inference engine, and distributed systems so that customer workloads stay on the Pareto frontier of cost and performance. We get there through deep workload insights, a scalable platform, and close collaboration with engineering and product teams. In this role you will lead a high impact team that partners closely with GTM, Product, and Engineering to redefine what an inference platform can be. You will turn real customer workloads into a continuous optimization loop, shape the product direction of Modular Cloud, and build the systems that let performance scale with demand.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager
Education Level
No Education Listed