Software Engineer II - Responsible AI

Microsoft•Washington, DC

20h

About The Position

Come build the core of Microsoft Copilot for enterprise with the Microsoft Turing team, where you’ll join a collaborative group pushing the frontier of large language models (LLMs) to improve the productivity of hundreds of millions of users around the world. Turing is responsible for the core systems that power Microsoft 365 Copilot Chat, delivering intelligence, quality, and transformative new features. We work at the intersection of research and engineering—advancing orchestrator reasoning, training next-generation models, and shipping impactful, model-driven experiences in Microsoft 365 Copilot. The Responsible AI group focuses on identifying, measuring, mitigating, and monitoring Responsible AI risks in AI-generated content spanning text, image, audio, video, and multimodal content. We are looking for a Software Engineer II who is passionate about building scalable systems and services that support Responsible AI evaluation, measurement, and mitigation across Copilot scenarios. In this role, you will work with an interdisciplinary group of engineers, applied scientists, linguists, and product managers to ensure that the innovative products and machine learning solutions Turing delivers to tens of millions of users every month are safe and reliable. You will help build and operate the technical foundations that enable Responsible AI at scale. This includes designing and implementing evaluation infrastructure as well as monitoring and mitigation tooling that integrates deeply with Microsoft 365 Copilot’s development lifecycle. Your work will have a direct impact on product quality, user trust, and the safety of AI systems used by millions of people every day. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day, we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Requirements

Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice To Haves

Master's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 5+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Bachelor's Degree or Minor in Computational Social Science (e.g., Computational Linguistics) or equivalent experience
2+ years of experience working on Responsible AI, AI safety, or closely related area (e.g., ML fairness, harm evaluation, safety mitigations).
Demonstrated ability to collaborate effectively across engineering, applied science, and interdisciplinary partner teams to deliver shared engineering outcomes.
Demonstrated software design, problem-solving, and data analysis skills, with demonstrated interest in Responsible AI, and a commitment to quality, performance, and engineering standard practices

Responsibilities

Design and build end‑to‑end (E2E) experiences that support the identification, measurement, and mitigation of Responsible AI (RAI) risks across M365 Copilot features, including (but not limited to) mainline chat, agents, chat history, and model iterations.
Develop and operate RAI evaluation infrastructure, including dashboards and tooling for monitoring safety telemetry, system performance, and error patterns. Create alerts and participate in live‑site support for safety‑critical systems.
Drive scalability and consistency of RAI evaluation techniques by extending methods and frameworks across diverse systems, features, and use cases.
Collaborate closely with partner teams and across disciplines to build, refine, and debug E2E experiences, proactively identifying, triaging, and root‑causing failures and safety regressions.
Continuously invest in technical growth, staying current with industry and internal practices to improve availability, reliability, safety, efficiency, observability, and performance.
Embody our culture and values.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume