Manager, Infrastructure Engineering

WorkdayPleasanton, CA
Hybrid

About The Position

The Infrastructure Site Reliability Engineering (SRE) team is the backbone of our global infrastructure. We are a collaborative group of engineers dedicated to driving automation, scalability, security, and resilience across the organization. By partnering closely with our Platform, Operations, and Security teams, we ensure that Workday’s services remain highly available, performant, and secure. We are committed to a culture of continuous improvement, leveraging innovative technologies to provide a seamless and superior experience for our customers. In this role, you will join our dynamic, distributed team and embark on a journey of professional growth. We share a collective responsibility for maintaining the high availability of Workday's production and development environments. You will be instrumental in developing innovative software solutions to build, scale, monitor, and refine our global infrastructure. You will empower our engineers to drive operational excellence by prioritizing automation and eliminating manual toil. By collaborating closely with internal stakeholders and customers to resolve incidents promptly, you will make a tangible impact on our world-class service delivery.

Requirements

  • 3+ years experience leading a 24x7 production environment, across multiple data centers
  • 7+ years of IT experience
  • 3+ years in a management role
  • 4+ years experience working with large IP based network
  • Proven ability to manage a team of high performers and highly motivated network and system engineers
  • Deep passion for leading, mentoring, and growing talent, with the interpersonal skills necessary to guide daily activities and build cross-organizational relationships
  • Strong focus on empowering engineers to drive operational excellence by prioritizing automation and eliminating manual toil
  • Proven knowledge of Linux Systems with previous hands-on experience in production environments
  • Experience in building, scaling, and monitoring global infrastructure across multiple data centers to maintain high availability
  • Deep knowledge of IP networking (BGP, load balancers, switching)
  • Experience supporting core internet services such as Mail, CDNs, and DNS
  • Ability to lead initiatives in infrastructure observability and process standardization to ensure world-class service delivery

Responsibilities

  • Maintain high availability of Workday's production and development environments
  • Develop innovative software solutions to build, scale, monitor, and refine global infrastructure
  • Empower engineers to drive operational excellence by prioritizing automation and eliminating manual toil
  • Collaborate closely with internal stakeholders and customers to resolve incidents promptly
  • Manage a team of high performers and highly motivated network and system engineers
  • Work with Site Reliability Engineering teams in Europe and APAC to provide 24x7 operations support for critical Workday services
  • Lead key goals in the areas of automation, infrastructure observability, and process standardization in Workday data centers
  • Partner with Customer and Application support teams to solve and remediate problems

Benefits

  • Workday Bonus Plan or a role-specific commission/bonus
  • Annual refresh stock grants
  • Comprehensive benefits
  • Flexible schedule that caters to your business, team, and personal needs
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service