Senior Research Engineer

Microsoft•Redmond, WA

66d

About The Position

Copilot usage is growing across Microsoft 365 and custom agent experiences. To keep pace with diverse customer needs, regulatory requirements, and rapid innovation in the model ecosystem, we’re expanding our model choice across multiple providers and modalities. A robust, data‑driven evaluation and observability platform ensures we select the right model for each scenario—balancing quality, safety, latency, and cost—and de‑risks vendor lock-in while increasing resilience and agility. Within Microsoft, our teams already compare models on capabilities, cost, and latency and visualize responsible AI metrics (e.g., groundedness, coherence, relevance, similarity) in integrated dashboards—this role accelerates and productizes those patterns for Copilot Studio makers and platform teams. You will build the backend systems, APIs, and evaluation pipelines that let Copilot and Copilot Studio safely and efficiently route requests across multiple model providers. You’ll partner with platform PMs, applied scientists, and reliability engineers to instrument end‑to-end quality signals, govern rollouts, and create decisioning frameworks that map model/provider selection to Copilot core use cases (authoring, reasoning, retrieval‑augmented generation, multi‑agent orchestration, and domain-specific tasks). Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice To Haves

Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Experience building backend services or APIs in a production environment using at least one modern programming language (e.g., C#, Java, Python, or TypeScript).
Experience working with large language models (LLMs) or similar AI systems in any capacity (development, integration, or evaluation).
Experience deploying or operating distributed systems or cloud-based services (e.g., Azure, AWS, or GCP) in a production environment.

Responsibilities

Build multi provider model routing services with policy driven selection, traffic splitting, and observability.
Develop evaluation pipelines for offline and online experiments; automate metrics like task success, groundedness, latency, and cost.
Operationalize model comparisons through scorecards, dashboards, and A/B testing for rollout decisions.
Harden backend infrastructure for scale, security, and compliance; implement caching, rate limiting, and safety filters.
Instrument telemetry and monitoring using OpenTelemetry; enable RCA and performance insights.
Collaborate cross functionally with PMs, scientists, and Responsible AI teams; document and evangelize best practices.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume