We are seeking a skilled AI DevOps System Administrator to build, manage, and optimize the infrastructure supporting our Artificial Intelligence and Machine Learning initiatives in a classified environment. The ideal candidate will be responsible for maintaining the CI/CD pipeline for ML models, managing GPU resources, and ensuring the stability, scalability, and security of the AI development and deployment environment. This role requires close collaboration with data scientists and ML engineers to streamline workflows from model development to production. As a seasoned leader, you’ll be involved with our client's decision-making process by serving as a front-line interface to users with technical issues and conducting systems analysis and development to keep systems current with changing technologies. Your duties may include installing new software, troubleshooting, granting permissions to applications and training users. You’ll also be responsible for the day-to-day support of server services by performing server administration for physical and virtual server operating systems and configuring, maintaining and troubleshooting of physical and virtual hardware and network related interfaces on servers. We’ll rely on you to perform, maintain, troubleshoot and conduct a complete analysis of alerts; create scripts to automate repetitive processes; and work with customers to identify, isolate, and resolve problems with servers that are affecting other services.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level