The Artificial Intelligence Performance team at Microsoft develops AI software that enables running AI models everywhere, from world’s fastest AI supercomputers, to servers, desktops, mobile phones, IoT devices and internet browsers. They collaborate with hardware teams and partners, operating at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. The team prides itself on a collaborative and inclusive culture. They own inference performance of OpenAI and other state-of-the-art LLM models, working directly with OpenAI on models hosted on the Azure OpenAI service, serving massive workloads with trillions of inferences per day in major Microsoft products. As a Senior Software Engineer - Performance, you will work on multiple levels of the AI software stack, including fundamental abstractions, programming models, compilers, runtimes, libraries, and APIs to enable large-scale training and inferencing of models. Responsibilities include benchmarking OpenAI and other LLM models for performance on GPUs and Microsoft HW, debugging and optimizing performance, monitoring performance, and enabling rapid deployment of these models with minimal hardware, contributing to Microsoft Azure's capex goals. This is a hands-on technical role requiring software design and development skills, seeking someone with a demonstrated history of solving technical problems and motivation to tackle challenges in building a full end-to-end AI stack. An entrepreneurial approach, initiative, and speed are essential. Microsoft's mission is to empower every person and every organization to achieve more, fostering a culture of inclusion based on values of respect, integrity, and accountability.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior