Building and supporting a reliable application suite for the environment to meet the development and maintenance requirements of systems/platforms Implement Service Reliability Engineering by working as part of the development team to evaluate the health, stability, and reliability of applications Lead the team in best practices in incident, problem, and change management Utilizing monitoring, alerts, dashboards, and management tools to ensure the availability, reliability, cost, and performance of applications and services Constantly working to improve and implement automation of applications tasks Providing technical support for systems/platforms according to application SLA's Responsible for designing and developing resiliency in the application code, troubleshooting incidents, engaging with squads to address failure patterns, and participating in incident management Develop delivery pipelines and automated deployment scripts Configure services, such as databases and monitoring
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Industry
Professional, Scientific, and Technical Services
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees