Develop and execute on LLM platform strategy for Copilot that extend language model's capabilities. Prototype approaches by steering language models to drive response quality across a wide range of scenarios. Identify and prioritize platform, orchestration and language model issues that impact quality, factuality and safety and working with engineers and researchers to find a path to resolution. Define and build measurable evaluations with relevant datasets to demonstrate quality improvements. Define, deploy and manage experiments in production that impact language model's tool use, driving measurable improvements in relevance for and engagement with Copilot users. Partner with product teams to scale tool building and work with inference, agents and orchestration teams to resolve dependencies. Accountable to own the status of key projects, proactively identifying risks and proposing solutions to ensure timely delivery. LLMs will be your clay - you should know how to prompt engineer them, how to evaluate them, and understand how they are tuned.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees