About The Position

The Cluster Operations Manager is responsible for one or more Amazon Data Center Clusters and Colocation Operations within a particular region. It is the senior Infrastructure Operations role within the region and has managerial responsibility for safety, security, availability, scaling, efficiency and cost. The Infrastructure Operations organizations are composed of two primary functions: Data Center operations (DCO) and Data Center Engineering Operations (DCEO). A physical security organization, while not reporting directly to the Cluster Operations Manager, is an integral part of the operation. Data Center Operations focuses on the server-level platforms that support both Amazon Retail and Amazon Web Services. Engineering Operations focuses on the mechanical, electrical and controls systems that support our data Center critical environments. Security Operations are charged with the physical security of our people, assets, and customer data. The Cluster Operations Manager must be able to build and lead high performing teams across each of these functions, understand and manage their daily operations while at the same time having the technical capability and curiosity to dive deep into any given challenges as needed. The Cluster Operations Manager is a key role in the management team that is operating and scaling the world’s largest cloud computing infrastructure. We encounter interesting, challenging and complex problems every day. As a technical manager in Amazon you can innovate to solve these issues and help drive operations excellence in all areas of your role. You will have the ability to refine and develop processes to optimize operational excellence in every aspect of your role. You must also have a passion for technology along with a desire to achieve best-in-the-world operational performance. AWS Infrastructure Services (AIS) owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help. You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.

Requirements

  • Bachelor's degree or above in computer science, computer engineering, or related field, or experience managing teams
  • 5+ years of management experience
  • 5+ years of operations management experience
  • Experience with Lean/Six Sigma
  • 4+ years of professional work experience, or experience in technical work related to computer systems and technology components

Nice To Haves

  • Knowledge of Lean Manufacturing & Continuous Improvement principles & techniques
  • Experience in identifying security issues and risks, and developing mitigation plans
  • 7+ years of team management experience
  • 10+ years of design, construction or program management in mission critical facilities experience
  • Experience communicating technical details verbally and in writing

Responsibilities

  • Hiring, managing and developing the operations team including compute operations managers, engineering operations managers, logistics operations managers and their teams
  • Attainment of organizational performance goals and objectives relating to safety, security, availability, scaling, efficiency and cost
  • Planning and executing the Infrastructure Operations component of new Data Centers and Colocation (Colo) expansions
  • Safety, security, and availability incident response, incident management and incident resolution
  • Continuous improvement of operational processes, procedures, methods and tools

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service