About The Position

The Gen AI Infrastructure and Solutions team is building large-scale GenAI training infrastructure, LLM-based solutions and tools. We provide the infrastructure for teams in CoreAI and other Microsoft Groups to fine-tune LLMs for their own scenarios. We also build services and solutions of SLMs and LLMs for 1P&3P on Azure, like Azure Machine Translation service. As a Senior Software Development Engineer - Gen AI Infrastructure and Solutions, you will work on the infrastructure and tools to support large scale model fine-tuning, and evaluation. You will also work on the service to host SLMs/LLMs for translation to process trillions of characters per month. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • 3+ years designing, developing, and shipping software.
  • 2+ years of experience with distributed systems and cloud-based infrastructure.
  • 1+ year of experience with DevOps practices (CI/CD, automated testing, deployment, etc.).
  • Ability to meet Microsoft, customer and/or government security screening requirements.

Nice To Haves

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • 1+ year of experience with containerization tools (e.g., Docker, Kubernetes).
  • Familiarity with production ML systems and concepts like model serving, caching, batching, and monitoring.

Responsibilities

  • Collaborate with senior engineers and researchers to build and optimize training infrastructure and tools for LLMs, SLMs, multimodal, and code-specific models.
  • Design and implement new AI features for Azure Machine translation and Language services.
  • Design, build and improve the services with high scalability and reliability.
  • Contribute to the deployment and monitoring of services in production environments.
  • Participate in the efforts to deliver and improve engineering systems and practices to ensure service quality in complex cloud environments.

Benefits

  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Professional, Scientific, and Technical Services

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service