Site Reliability Managers at UKG have a breadth of knowledge encompassing all aspects of service delivery and management. This SRE role is primarily responsible for application reliability, performance, and operability as software runs on the underlying platform. The team focuses on how applications behave in production — including scalability, stability, resource usage, and failure recovery — rather than feature development. They lead and grow teams that develop solutions to increase resiliency and support our Cloud Engineering and Infrastructure. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and automation. Site Reliability Managers are passionate about learning and evolving with current technology trends and enabling their teams to do the same. They strive to innovate and are relentless in pursuing a flawless customer experience. They have an "automate everything" mindset, helping us bring value to our customers by leading their teams. Deploy services with incredible speed, consistency, and availability.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Manager