Senior Manager, Hybrid Services & Reliability (SRE)

General MotorsAustin, TX
1dHybrid

About The Position

The Role: As the Senior Engineering Manager for Hybrid Services & Reliability (HSR) within AV Core Infrastructure (ACI) at GM, you are the architect of our system trust. You will lead a newly seeded team responsible for the measurable availability of the hybrid cloud systems that underlie all autonomous vehicle development and operations. We need a leader who views reliability not as an afterthought, but as an inherent property of the platform, ensuring that all teams have a stable and ready-state engineering environment. You are comfortable operating systems at scale, not just designing them. What you’ll do Reliability Engineering: Define, measure, and enforce strict SLOs/SLIs for critical hybrid cloud services, including network connectivity and compute readiness. Foundational Utilities: Own and manage core on-prem utilities, such as DHCP, PXE, and CDN, to ensure seamless server auto-provisioning across the global fleet. Environment Integrity: Manage the entire data flow path, from initial ingestion at the test bench through the secure cloud network into production staging. HIL Readiness: Guarantee the 99%+ availability and stability of remote CI-based Hardware-in-the-Loop (HIL) benches required for AV safety validation. Organization Growth: Actively lead the recruitment and technical mentorship of Senior and Staff ICs as part of the team's expansion.

Requirements

  • Extensive background in Site Reliability Engineering (SRE) and defining SLO/SLI frameworks for hybrid cloud environments.
  • Technical proficiency in managing on-prem Linux utilities (DHCP/PXE/NTP) and core development services.
  • Opinionated view on automated observability, incident response, and MTTR reduction.
  • Proven leadership experience.

Nice To Haves

  • Experience with configuration management tools (e.g., Chef, Ansible) for large-scale, remote hardware fleets.

Responsibilities

  • Define, measure, and enforce strict SLOs/SLIs for critical hybrid cloud services, including network connectivity and compute readiness.
  • Own and manage core on-prem utilities, such as DHCP, PXE, and CDN, to ensure seamless server auto-provisioning across the global fleet.
  • Manage the entire data flow path, from initial ingestion at the test bench through the secure cloud network into production staging.
  • Guarantee the 99%+ availability and stability of remote CI-based Hardware-in-the-Loop (HIL) benches required for AV safety validation.
  • Actively lead the recruitment and technical mentorship of Senior and Staff ICs as part of the team's expansion.

Benefits

  • GM offers a variety of health and wellbeing benefit programs.
  • Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.
  • This job may be eligible for relocation benefits.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service