Principal Applied Scientist (CoreAI)

MicrosoftRedmond, WA
3d

About The Position

Joining the CoreAI organization at Microsoft means becoming part of the team that builds the end-to-end AI stack powering Azure’s innovation. As a member of the GenAI Infra and Solutions team within CoreAI, you will help develop the AI infrastructure that accelerates the creation of agentic AI systems across Microsoft. This role is dedicated to advancing scientific methods and scalable infrastructure for training agentic models to achieve frontier-level performance. You will contribute to LLMs, SLMs, and agentic models using both proprietary and open-source frameworks, all aimed at delivering reliable, enterprise-grade agentic workflows.

Requirements

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
  • OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years

Nice To Haves

  • 5+ years of coding experience in Python and experience with ML frameworks such as PyTorch and Triton
  • 3+ years experience of large-scale model training for LLMs, SLMs, and agentic models
  • 3+ years of proven ability to design and scale training infrastructure and pipelines in production environments
  • Experience with agent training frameworks
  • Experience with large-scale distributed training and/or serving with demonstrated ability to dive deep into complex systems, troubleshoot unconventional issues, and craft innovative solutions under real-world constraints
  • Extensive experience with large-scale training, model inference, reinforcement learning, and reasoning models

Responsibilities

  • Write efficient, production-quality code and debug complex training jobs
  • Build and maintain training pipelines and architectures across both proprietary and open-source frameworks
  • Collaborate effectively within interdisciplinary teams and communicate complex research concepts in clear, actionable ways
  • Document findings and insights to enable effective cross-team collaboration and knowledge sharing
  • Drive innovations that power flagship Microsoft products and services

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service