About The Position

Hire, manage, and grow a high-performing team of software engineers, fostering a culture of excellence, inclusion, and innovation. Lead the design and development of large-scale distributed systems and services that power Azure's AI infrastructure. Drive engineering planning and execution while ensuring alignment with organizational OKRs and long-term strategy. Establish lean, scalable, and efficient processes that promote innovation and engineering rigor. Deliver best-in-class engineering by ensuring services and components are modular, secure, reliable, diagnosable, observable, and reusable. Improve test coverage, automation, and integration testing to proactively identify and resolve reliability gaps. Ensure live-site reliability and service health through robust monitoring, telemetry, and automation.

Requirements

  • Bachelor's Degree in Computer Science or related technical field 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • 2+ years people management experience.
  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • 6+ years people management experience.

Responsibilities

  • Hire, manage, and grow a high-performing team of software engineers
  • Lead the design and development of large-scale distributed systems and services that power Azure's AI infrastructure
  • Drive engineering planning and execution while ensuring alignment with organizational OKRs and long-term strategy
  • Establish lean, scalable, and efficient processes that promote innovation and engineering rigor
  • Deliver best-in-class engineering by ensuring services and components are modular, secure, reliable, diagnosable, observable, and reusable
  • Improve test coverage, automation, and integration testing to proactively identify and resolve reliability gaps
  • Ensure live-site reliability and service health through robust monitoring, telemetry, and automation
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service