As a Site Reliability Engineer, you will play a pivotal role in ensuring the reliability, availability and performance of our cloud infrastructure and operating systems in mission critical client solutions around the globe. To do this you will design, manage and execute the upgrade and maintenance schedule for a defined list of clients. You will work ongoingly to automate infrastructure processes, implement best practices and introduce new approaches and tools that enhance our software delivery pipeline and reliability and performance of live client solutions. Success is working proactively to predict client needs, increase efficiencies and ultimately increase customer satisfaction and reduce the number and severity of support incidents. This means exceeding our SLAs and SLOs. You will produce upgrade and maintenance plans for all clients under your responsibility, and work with your team and client contacts to deliver to the plan on time. You will implement and review infrastructure monitoring and observability tools, identifying planning and delivering initiatives that deliver business and client value and reduce risk. The Orion Health Tech Ops group exists to exceed client expectations in the maintenance and improvement of their Orion Health solutions. The Operations division succeeds together, so strong collaboration will be required with all other roles across the Tech Ops and Service Mgmt groups. Wider internal key relationships will be developed with Product, Delivery and Solutions teams.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
Associate degree