AI developers and researchers today face significant friction in taking trained models into deployment across different types of hardware. They work in a highly fragmented space, with incomplete and patchwork solutions that require integration of many open source components that often times work for one HW vendor but not the other. At Modular, we are building the next generation AI platform that will radically improve the way developers build and deploy AI models. A core part of this offering is MAX - a modern AI inference solution built on the portable Mojo platform for heterogeneous AI acceleration. The Modular platform allows customers to achieve state-of-the-art performance across model families and hardware types. As the leader of the high-caliber MAX team, you will manage an engineering organization specializing in a state of the art inference framework, full stack optimizations including SoTA Kernel on latest GPUs and Accelerators, tools that enable developers to better understand performance, serving optimizations for distributed serving, all to enable frontier models. This organization is building MAX as the next generation framework for AI developers, and revolutionizing AI development and research. This is a fantastic opportunity for a leader to help drive the core technology at Modular! LOCATION: Candidates based in the US or Canada are welcome to apply. You can work in our office in Los Altos, CA or remotely from home. Onboarding for new hires is conducted in-person in our Los Altos, CA office.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Executive
Education Level
No Education Listed