Senior Engineer, Hybrid Services & Reliability (SRE)

General MotorsAustin, TX
1dHybrid

About The Position

We are hiring a Senior Engineer to join the Hybrid Services & Reliability (HSR) team as a foundational member. At GM, AV Core Infrastructure (ACI) is the engine room responsible for the unified, secure hybrid platform that powers all autonomous vehicle hosting, validation, and testing. You will be responsible for building and maintaining the "system trust" required for Super Cruise validation, where the 95%+ readiness of our "bench cloud" environment is a Tier-0 requirement. We are looking for an engineer who is uncomfortable with manual toil and is driven to build systems where scaling and recovery are inherent properties. Technical Domain Responsibilities:

Requirements

  • Proven professional experience in Site Reliability Engineering (SRE) or DevOps, ideally within a hybrid cloud environment.
  • Strong proficiency in Linux systems administration and the management of core networking services (DHCP/PXE).
  • Hands-on experience with Infrastructure as Code (IaC) and configuration management tools (e.g., Chef, Ansible, or Terraform).
  • Ability to break down broad technical challenges into clear, implementation-ready initiatives with minimal supervision.
  • A "Growth-based Mindset" with a commitment to continuous learning and upskilling in a high-velocity environment.

Nice To Haves

  • Experience with Kubernetes (k8s) and monitoring high-throughput data pipelines.

Responsibilities

  • Reliability Execution: Implement and manage Service Level Objectives (SLOs) and SLIs for critical hybrid services, ensuring the platform meets rigorous uptime and readiness targets.
  • Service Automation: Drive the automation of foundational on-prem utilities—including DHCP, PXE, and NTP—to ensure the fleet of remote CI-based hardware benches is always provisioned and ready-state.
  • Observability & Response: Build and optimize observability stacks (dashboards and alerting) to detect system degradation before it impacts developers, focusing on reducing Mean Time to Recovery (MTTR).
  • Environment Stability: Own the integrity of data ingestion paths from physical test benches through the secure cloud network, ensuring dependencies are stable and performant.
  • Operational Excellence: Identify and eliminate "human duct tape" by replacing manual, repetitive tasks with robust automation primitives and self-service tools.
  • Mentorship: Provide technical guidance and peer reviews for other engineers, fostering a culture of high-quality code and resilient architecture.

Benefits

  • GM offers a variety of health and wellbeing benefit programs.
  • Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service