The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world's most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the NCCS computing infrastructure which supports multiple highly ranked Top500 Supercomputers, including the world's first exaflop system, Frontier. As a Kubernetes Engineer for the Platform team, you will work within the Platforms group to support all activities of our supercomputer center. Our primary platform is the OLCF Slate Service, built on Kubernetes and Red Hat OpenShift, which provides a container orchestration service for running critical operation applications and user-managed persistent applications that run alongside our OLCF Supercomputer systems and other OLCF managed HPC clusters. As a Platform Engineer, you will operate, implement, and maintain the infrastructure underpinning our on-premises Kubernetes clusters, with a strong focus on scalability, reliability, and maintainability. You will assist with our platform engineering initiatives, evaluate and integrate key technologies, be an individual contributor for large and medium sized projects, and assist a team of engineers in delivering a robust internal platform that powers development across the organization. This role requires: Kubernetes experience, Kubernetes cluster admin experience, Linux Sysadmin experience, knowledge of data center hardware, and significant experience scripting and automation.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Industry
Professional, Scientific, and Technical Services
Number of Employees
5,001-10,000 employees