About The Position

We are looking for a creative and highly motivated Site Reliability Engineer to join our team. Having depth and breadth of knowledge working in physical infrastructure in a large-scale distributed environment is a strength you'll need. You should have experience in unix systems administration, DevOps, and data center infrastructure. If you are passionate about solving complex problems at scale, we want to hear from you! The Systems and Infrastructure team builds and manages world class services and physical infrastructure for Apple software engineers world wide to build, test, and release Apple's software. We are a team dedicated to engineering excellence, reusable design, and simplicity. We foster a supportive, growth-focused culture where we mentor each other and work together to build resilient, high-quality systems.

Requirements

  • 3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or Systems Admin focused on physical infrastructure in a large-scale distributed environment
  • Strong software development skills in a language like Swift, Go, or Python
  • High degree of comfort with shell scripting (Bash)
  • Hands-on experience building and managing systems with container orchestration tools (Kubernetes, Docker)
  • Deep understanding of networking (TCP/IP, DNS, HTTP)
  • Experience using observability tools (monitoring, logging, tracing) to diagnose complex issues
  • Excellent problem-solving and communication skills, with a strong sense of ownership and drive
  • BS/MS in Computer Science, Engineering or related field

Nice To Haves

  • Experience with Unix/Linux systems administration and command-line diagnostic tools
  • Proven experience leading initiatives to reduce technical debt, refactor systems, or improve performance and latency
  • Expertise in performance analysis and capacity planning for physical infrastructure
  • Demonstrated ability to lead incident response for high-impact outages
  • Familiarity with using Generative AI (GenAI) or Large Language Models (LLMs) to accelerate operational tasks, such as automating runbooks, generating scripts, or analyzing incident data

Responsibilities

  • Build and manage world class services and physical infrastructure for Apple software engineers world wide to build, test, and release Apple's software
  • Work together to build resilient, high-quality systems
  • Build automation tools that eliminate routine tasks
  • Lead initiatives to reduce technical debt, refactor systems, or improve performance and latency
  • Perform performance analysis and capacity planning for physical infrastructure
  • Lead incident response for high-impact outages
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service