About The Position

The AI Frameworks team at Microsoft develops AI software that enables running AI models everywhere, from world’s fastest AI supercomputers, to servers, desktops, mobile phones, IoT devices, and internet browsers. We collaborate with our hardware teams and partners to build the software stacks for novel AI accelerators. We work closely with ML researchers and developers to optimize and scale out model training and inference. The team operates at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. We are a cross-discipline team of highly capable and motivated people with a collaborative and inclusive culture. As a Principal Software Engineer - Simulation Platform, you will be responsible for designing, implementing, and ensuring quality of AI chip simulator, related tools and its integration and timely delivery as part of software development kit (SDK). This is a technical role that demands hands-on experience in all phases of software development: concept, software design, implementation, and verification. An entrepreneurial mindset and quick adaptation to innovative ideas and concepts are critical for fast ramp-up and contribution to a large-scale project. Knowledge of computer architecture and/or embedded programming is essential for successful candidates. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C++, C, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice To Haves

  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C++, C, or Python OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C++, C, or Python OR equivalent experience.
  • 3+ years of experience in systems programming (C, C++, Rust, or similar), with a focus on low-level or performance-critical software
  • Solid understanding of memory models, concurrency, and interprocess communication
  • Experience working on infrastructure involving hardware interfaces or device communication (e.g., PCIe, DMA, RDMA, or similar)
  • Familiarity with Graphics Processing Unit (GPU), Tensor Processing Unit (TPU), or other accelerator architectures and their runtime systems
  • Experience implementing communication protocols or working with driver/kernel interfaces
  • Exposure to observability or profiling tools (e.g., eBPF, trace buffers, performance counters, telemetry hooks)

Responsibilities

  • Developing hardware simulator of next generation AI chips
  • Technical contribution to design, implementation, verification, and documentation of code ensuring on-time deliveries of simulator releases used daily by parter teams (C++ and Python)
  • Collaborate broadly across multiple disciplines and with various partner teams from hardware designers to AI models developers.
  • Identify requirements, scope solutions, estimate work, schedule deliverable
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service