We are seeking a senior Site Reliability Engineer for our DevSecOps team. The DevSecOps team is a high-impact group of 8 engineers who are responsible for supporting the national operations of our VI deployments including our VehicleManager portfolio. These are highly available 24x7 cloud applications. Your SRE role encompasses deployment architecture, continuous software delivery, observability, high availability, disaster tolerance, survivability, testing, continuous improvement, all with automation. You'll work with a dynamic and energetic team to support and mentor you. This role is central to defining and implementing our next-generation observability strategy. The ideal candidate will be a hands-on expert in the full observability stack, with deep experience in leveraging metrics, logs, and distributed traces to proactively ensure optimal end-user experience and system health. The candidate should have experience with our platform's technology stack, including Kubernetes with .NET containers. Success in the first year means the candidate contributes to availability and SRE rigor of our Vehicle Intelligence applications. Candidates must demonstrate an advanced ability to diagnose complex, multi-service production incidents using APM tools, translating deep technical analysis into actionable insights for engineering and product teams.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Industry
Computer and Electronic Product Manufacturing
Number of Employees
5,001-10,000 employees