The Microsoft Azure Artificial Intelligence/High Performance Computing (AI/HPC) team is seeking software engineers to enable customers in deploying, monitoring, profiling, and debugging their applications on hyperscale cloud infrastructure. Azure supports the largest supercomputing deployments for complex computational problems in the public cloud, with its HPC products recognized on Top500, MLPerf, and Graph500 rankings. At this supercomputing scale, specialized tools and techniques are crucial for maintaining reliability, runtime performance, system health, and job execution to meet customer Service Level Agreements (SLAs). The role involves building and utilizing state-of-the-art cloud applications and services to identify operational gaps and implement features for the smooth operation and management of cloud-native supercomputers. As a Senior Supercomputing Engineer, responsibilities also include establishing best practices, driving architectural changes, and influencing the roadmap of relevant software and hardware components. This work directly impacts the business goals of a wide range of users and fosters growth and innovation in AI and HPC in the cloud. Microsoft's mission is to empower every person and organization to achieve more, fostering a culture of inclusion, respect, integrity, and accountability.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior