Senior Site Reliability Engineer

Motorola SolutionsPhoenix, AZ
28dRemote

About The Position

We are seeking a senior Site Reliability Engineer for our DevSecOps team. The DevSecOps team is a high-impact group of 8 engineers who are responsible for supporting the national operations of our VI deployments including our VehicleManager portfolio. These are highly available 24x7 cloud applications. Your SRE role encompasses deployment architecture, continuous software delivery, observability, high availability, disaster tolerance, survivability, testing, continuous improvement, all with automation. You'll work with a dynamic and energetic team to support and mentor you. This role is central to defining and implementing our next-generation observability strategy. The ideal candidate will be a hands-on expert in the full observability stack, with deep experience in leveraging metrics, logs, and distributed traces to proactively ensure optimal end-user experience and system health. The candidate should have experience with our platform's technology stack, including Kubernetes with .NET containers. Success in the first year means the candidate contributes to availability and SRE rigor of our Vehicle Intelligence applications. Candidates must demonstrate an advanced ability to diagnose complex, multi-service production incidents using APM tools, translating deep technical analysis into actionable insights for engineering and product teams.

Requirements

  • Must be a US Citizen due to security compliance for this role; working visa not accepted
  • Bachelor's of Science degree in Computer Science or related applied technology field
  • 3 years experience with cloud operations and Site Reliability Engineering
  • Deep understanding of Observability and Application Performance Management
  • Deep understanding of cloud architecture, microservices, pipelines/workflows, and Ansible
  • Hands-on experience with Microsoft .NET, C#
  • Expertise with cloud software delivery pipelines, including GitHub
  • Ability to independently and collaboratively solve problems in a dynamic, fast-paced environment
  • Strong communication skills, capable of conveying technical concepts to a diverse audience
  • Bachelors degree with 3+ years of cloud operations and site reliability engineering
  • Must be a U.S. citizen with the ability to obtain necessary security clearance as required by government contract.

Responsibilities

  • Analysis of the end user experience, system performance, availability, uptime
  • Drive the creation, deployment, and optimization of our core Observability platforms (e.g., Prometheus, Grafana, etc.) to ensure complete system visibility
  • Monitoring and troubleshooting the system
  • Creating workflow automation
  • Managing and creating levels of automated test

Benefits

  • Incentive Bonus Plans
  • Medical, Dental, Vision benefits
  • 401K with Company Match
  • 10 Paid Holidays
  • Generous Paid Time Off Packages
  • Employee Stock Purchase Plan
  • Paid Parental & Family Leave
  • and more!

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service