About The Position

Amazon Web Services (AWS) is seeking a Data Center Operations (DCO) Cluster Manager to join the AWS Infrastructure Services organization and serve as a technical resource and leader within data centers. We are currently building out our infrastructure management team for an up and coming region and are looking for proven technology managers with experience in people management, strong technical understanding, and the drive to take AWS DCO to the next level. The DCO Cluster Manager is the senior leadership role for our compute operations teams within an AWS region that operates 24/7. You will have managerial responsibility for safety, security, availability, scaling, costs and efficiency for your department. You lead the team that is installing, maintaining, and decommissioning network and server equipment in a safe, secure, and cost-effective manner across the region. The DCO Cluster Manager must manage across each function but also have the ability to dive deep into any given function as needed. The DCO Cluster Manager must be physically collocated near the region they are responsible for and able to respond to any high-severity event and be on site within an hour. The successful candidate will be a highly driven, self-managed individual who demonstrates initiative and proactively seeks solutions to problems. They will have a strong track record of developing talent and managing the performance of their direct reports and organization; including being able to support a high cadence of developing people into new roles outside of the organization. Ideally, they have worked with ticketing systems and been involved in responses to high-severity operational events. In addition to strong knowledge in data centers and a broad technical understanding of how networks and cloud architecture works, the candidate will create documentation, drive continuous improvement, participate in Inclusion and Diversity initiatives, and fix complex problems with simple solutions across multiple AWS regions. While not required, an understanding of critical electrical & HVAC systems will enhance a candidate’s ability to be successful. This team works in an environment that operates 24/7. This position requires that the candidate selected be a US citizen and currently possess and maintain an active Top Secret security clearance with SCI eligibility. The position further requires that, after start, the selected candidate obtain and maintain an active TS/SCI security clearance with polygraph and satisfy other security related requirements.

Requirements

  • 4+ years of management experience
  • Knowledge of information technology infrastructure domains such as compute server platforms, storage server platforms, server components, network devices, technologies and architectures, IT service delivery principles and best practices
  • Experience hiring, developing, and managing high-performing technical teams
  • Current, active US Government Security Clearance of Top Secret with SCI eligibility or above

Nice To Haves

  • Knowledge of building codes and regulations including Life Safety, BOCA, NFPA, NEC, and OSHA
  • Experience owning the operation of a mission-critical team or product
  • Experience with large-scale technical operations or large-scale compute farms
  • Experience with process improvement techniques such as Kaizen, Lean Manufacturing or Six Sigma

Responsibilities

  • Hiring, managing, and developing the operations management team including DCO site managers, Decom managers, DCO technicians, and Decom technicians.
  • Oversee the safety, security, availability, quality, and performance of the team, while driving a positive customer experience across a 24/7 shift schedule.
  • Prioritize projects assigned to DCO teams and sites.
  • Routinely review ticket queue for large events and address problems accordingly.
  • Coordinate change management resources.
  • Guide, train, and educate data center staff on the best practices related to all service owner issues.
  • Manage front line managers. This includes mentoring, training, and developing career progression for both direct reports and members of the organization.

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service