The Lead Systems Administration role involves performing the deployment of software updates on a cloud platform consisting of Linux OS, BIOS/firmware, OpenStack, Kubernetes, Calico, Ceph, Maria DB, and other software components. The position ensures continual service availability, troubleshoots and resolves production problems, and provides incident response, management, and root cause analysis. Responsibilities include proactive development and implementation of monitoring systems, maintaining and improving tools, scripting, and automation infrastructure for configuration management, maintenance, testing, auditing, problem remediation, and capacity planning. The role also involves collaborating with and managing vendors to drive defect resolution and enhancements, conducting routine hardware and software audits, performing setup, maintenance, and monitoring of backups, developing standard operating procedures, performing security compliance and remediation, and conducting Operational Readiness Testing and Operational Acceptance Testing. Additionally, the role provides integration and advisement to tenants, supports systems before launch through collaboration with various teams, and performs feasibility assessments, requirements creation, project management, and technical solution integration and testing.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Number of Employees
5,001-10,000 employees