About The Position

We are seeking experienced engineers to help build cloud‑native, open‑source AI frameworks and platforms that power AI/ML training, fine‑tuning, inference, and agentic applications at scale. This role focuses on designing and implementing Kubernetes‑native abstractions and operators that make advanced AI workloads reliable, scalable, and easy for developers to consume across cloud and hybrid environments. You will contribute to and help lead work in upstream open‑source communities while shaping and building production‑grade AI platforms used by internal teams and external customers. The ideal candidate has hands‑on experience building or operating AI/ML training, fine‑tuning, and inference platforms in cloud‑native environments, and is passionate about shaping the future of cloud‑native AI infrastructure through open collaboration. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Nice To Haves

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, or Python OR equivalent experience.
  • Hands‑on experience building or operating AI/ML training, fine-tuning, and inference platforms in cloud‑native environments.
  • Proficiency with Go and/or Python for building platform components, Kubernetes operators/controllers, and integrations in production environments.
  • Demonstrated experience contributing to or maintaining open‑source software, especially in the Kubernetes, AI/ML, or cloud‑native ecosystem.
  • #azurecorejobs

Responsibilities

  • Design, implement, and maintain Kubernetes operators and controllers for AI/ML workloads
  • Partner with product managers, business stakeholders, and users to understand user pain points deeply and create innovative solutions that delight your customers in an agile development environment.
  • Contribute to applicable upstream open-source projects
  • Write technical design documents and participate in architecture reviews
  • Mentor team members and external contributors through code reviews
  • Debug and optimize distributed AI systems running at scale
  • Strive for excellence in everything you do: culture, collaboration, process, tools, design, engineering practices, customer experience, performance, security etc.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service