OneDrive and SharePoint (ODSP) power the world’s most impactful intranets, collaboration experiences, business workflows, and content ecosystems. As AI becomes deeply embedded across these surfaces—from search, Q&A, and summarization to powerful synchronous and autonomous agents—our ability to measure quality, reliability, and safety at scale becomes a strategic advantage. Evaluation, both offline and online, is now the way we build and ship AI. As a Senior Software Engineer for the Eval Tooling team, you will help shape and deliver the systems for validating, measuring, and improving AI quality across ODSP Experiences. Your mission starts with elevating developer productivity and enabling fast, confident iteration across a broad and rapidly expanding set of AI workloads: RAG, agents, content generation, semantic search, content understanding, and ODSP’s emerging agents that orchestrate multi‑step actions across files, lists, and sites. You will also partner with Applied Science and Customer Success teams to scale customer data sets. You will partner closely with evaluation platform and tooling efforts across M365 to both leverage shared capabilities and contribute back to the broader ecosystem—we are One Microsoft. This is a hands‑on technical and strategic role where you will define how ODSP Experiences builds trust in AI and ships AI safely, quickly, and confidently. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level