Why GMF Technology? At GM Financial, innovation drives everything we do. We’re not just adopting technology — we’re shaping the future of software delivery. From generative AI and cloud-native platforms to advanced release engineering practices, our teams are redefining how financial technology operates. This role is central to that transformation, influencing how we build, release, and scale software globally. Join us and discover a workplace where your ideas matter, your development is prioritized, and you can truly make a global impact. About The Role: The Site Reliability Engineer under the general direction from the leadership will assist in the day-to-day tasks critical to the team's success. The position will be responsible for supporting cloud infrastructure architecture and components, including hybrid cloud and Public Cloud platforms. This will include prototyping, initiating, and operationalizing of Public Cloud solutions. The role will also be supportive of overall Cloud Transformation initiatives designed to meet key goals in creating a service-driven culture through performance and delivery of SaaS, PaaS, and IaaS solutions by public cloud vendors such as Azure and AWS. The Site Reliability Engineer will be responsible for configuration, efficiency, and performance of the deployed public cloud solutions. The scope of the role includes not only cloud engineering, but advanced level automation capabilities, and even some overlap into software development disciplines. Build and demonstrate a foundational understanding of SRE concepts, including observability, monitoring, incident response, and the core systems owned by the team. Execute standard operational tasks independently using established processes, runbooks, tooling, and escalation paths; raise issues when scenarios become complex or unfamiliar. Perform initial troubleshooting for clear production or environment issues with limited guidance; contribute findings and next steps to the broader resolution effort. Demonstrate ownership of learning by seeking mentorship, asking questions, and contributing back to shared team knowledge. Help teams apply SRE operational readiness practices using the SRE Checklist—with emphasis on detection/observability, performance, resiliency, automation, and operational readiness before go‑live. Assist with defining and implementing basic monitoring coverage aligned to Golden Signals (e.g., latency, traffic, errors, saturation/capacity) and validate telemetry appears correctly in monitoring platforms. Follow established standards for cloud based resources in Azure environment for automation and troubleshooting. Support logging and exception-handling hygiene by aligning to known standards (e.g., ensuring correlation IDs and key dimensions are captured where required). Assist and provide systems administration setup/configuration as needed for supported services and environments. Contribute to toil reduction by helping implement/maintain repeatable operational mechanisms (e.g., health checks/probes and monitoring configuration) as defined in standards and patterns.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Entry Level
Education Level
Associate degree