About The Position

Our team builds and operates core Azure Storage infrastructure that powers Blob Storage and ADLS at hyperscale. We focus on distributed systems, performance, scalability, and reliability of the data plane, enabling customers to run mission-critical and AI/ML workloads with predictable latency, high throughput, and strong durability guarantees. The team works end-to-end—from design and implementation to live-site operations—on systems that run at massive scale across Azure storage stamps. A Senior Software Engineer in this role will have direct impact on the performance, reliability, and scale of Azure Storage used by enterprises customers, operating at multiple exabytes and hundreds of millions of requests per seconds. This person will design and implement changes in core request paths, contribute to architecture decisions, and help improve latency, throughput, and fault tolerance of large-scale distributed systems.

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, Rust, C# OR equivalent experience.
  • Ability to meet Microsoft customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Nice To Haves

  • Bachelor's Degree in Computer Science OR related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, Rust, C# OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, Rust, C# OR equivalent experience.
  • 3+ years of experience in designing and developing reliable, high-performance distributed systems with a strong focus on scalability and resilience.
  • Demonstrated experience with distributed systems and large-scale storage, including solving complex systems problems and working on storage internals such as availability, remote replication, durability schemes, and high-performance I/O.
  • Practical knowledge in one or more technical domains: database internals, indexing and search techniques, caching and data placement strategies, and metadata/namespace management at scale.
  • Ability to influence technical direction and contribute to architecture decisions across teams, fostering collaboration and innovation.

Responsibilities

  • Reasoning about complex distributed failure modes
  • Making performance vs. reliability trade-offs at scale
  • Debugging production issues in high-traffic, multi-tenant environments
  • Delivering changes safely while operating a 24×7 global service
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service