Senior Site Reliability Engineer

Motorola Solutions•Phoenix, AZ

77d•Remote

About The Position

We are seeking a senior Site Reliability Engineer for our DevSecOps team. The DevSecOps team is a high-impact group of 8 engineers who are responsible for supporting the national operations of our VI deployments including our VehicleManager portfolio. These are highly available 24x7 cloud applications. Your SRE role encompasses deployment architecture, continuous software delivery, observability, high availability, disaster tolerance, survivability, testing, continuous improvement, all with automation. You'll work with a dynamic and energetic team to support and mentor you. This role is central to defining and implementing our next-generation observability strategy. The ideal candidate will be a hands-on expert in the full observability stack, with deep experience in leveraging metrics, logs, and distributed traces to proactively ensure optimal end-user experience and system health. The candidate should have experience with our platform's technology stack, including Kubernetes with .NET containers. Success in the first year means the candidate contributes to availability and SRE rigor of our Vehicle Intelligence applications. Candidates must demonstrate an advanced ability to diagnose complex, multi-service production incidents using APM tools, translating deep technical analysis into actionable insights for engineering and product teams.

Requirements

Must be a US Citizen due to security compliance for this role; working visa not accepted
Bachelor's of Science degree in Computer Science or related applied technology field
3 years experience with cloud operations and Site Reliability Engineering
Deep understanding of Observability and Application Performance Management
Deep understanding of cloud architecture, microservices, pipelines/workflows, and Ansible
Hands-on experience with Microsoft .NET, C#
Expertise with cloud software delivery pipelines, including GitHub
Ability to independently and collaboratively solve problems in a dynamic, fast-paced environment
Strong communication skills, capable of conveying technical concepts to a diverse audience
Bachelors degree with 3+ years of cloud operations and site reliability engineering
Must be a U.S. citizen with the ability to obtain necessary security clearance as required by government contract.

Responsibilities

Analysis of the end user experience, system performance, availability, uptime
Drive the creation, deployment, and optimization of our core Observability platforms (e.g., Prometheus, Grafana, etc.) to ensure complete system visibility
Monitoring and troubleshooting the system
Creating workflow automation
Managing and creating levels of automated test