This role focuses on operating and supporting OpenShift Virtualization (KubeVirt) platforms that host critical virtual machine workloads. The engineer will be responsible for day-2 platform operations, including upgrades, patching, capacity management, and incident response. They will also solve complex production issues across OpenShift Virtualization/KubeVirt, KVM and Linux virtualization subsystems, OpenShift control plane and node components, and networking and storage integrations. The position involves performing VM lifecycle operations such as provisioning, live migration, snapshots, backups, and recovery. A key aspect of the role is to improve platform stability and reliability through automation, monitoring, and operational tooling, utilizing Ansible, CI/CD pipelines, and infrastructure-as-code practices. The engineer will lead root-cause analysis, drive corrective and preventative actions, and partner with SRE, networking, storage, and security teams. Creating and maintaining runbooks, operational documentation, and on-call playbooks, as well as mentoring other engineers, are also important responsibilities.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed