About The Position

As a Computer Specialist I, II, III, or Senior, you will play a crucial role in optimizing and managing our high-performance computing infrastructure and supporting network to support complex scientific, engineering, and research applications at the Mississippi State University (MSU) High Performance Computing Collaboratory (HPC2). Must be a U.S. Citizen or Permanent Resident. Salary Grade : 15 Please see Staff Compensation Structure or Skilled Crafts and Service Maintenance Compensation Schedule for salary ranges. For salary grade UC, these positions are "Unclassified" and salary ranges are determined by the hiring department. Department Profile: The High Performance Computing Collaboratory (HPC²), an evolution of the MSU NSF Engineering Research Center (ERC) for Computational Field Simulation, at Mississippi State University is a coalition of member institutes and centers that share a common core objective of advancing the state-of-the-art in computational science and engineering using high performance computing; a common approach to research that embraces a multi-disciplinary, team-oriented concept; and a commitment to a full partnership between education, research, and service. The mission of the HPC² is to serve the University, State, and Nation through excellence in computational science and engineering. Area of Specialization: Linux Operating Systems

Requirements

  • Level I Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Level II Bachelor’s degree in Computer Science, Engineering, or related field plus 4 years of experience directly related to the duties and responsibilities specified.
  • Level III Bachelor’s degree in Computer Science, Engineering, or related field plus 8 years of experience directly related to the duties and responsibilities specified.
  • Senior Bachelor’s degree in Computer Science, Engineering, or related field plus 10 years of experience directly related to the duties and responsibilities specified.
  • Promotions are based off years of service as well as knowledge, skills, abilities, and performance.
  • Substitution allowed: Any equivalent combination of education and experience will be considered with HR Approval.
  • Strong written and oral communication skills.
  • Demonstrated ability to understand and document software business rules and requirements using diagrams, written use cases, and user stories.
  • Strong analytical and technical skills with an attention for detail and accuracy.
  • Strong organization skills and the ability to self-direct efforts to complete planning and documentation requirements.
  • Strong problem-solving skills
  • Self-motivated and directed, with the ability to work both independently and in a team-oriented, collaborative environment.

Nice To Haves

  • Proficient with UNIX-like operating systems (UNIX/Linux), including experience as a system administrator for virtual and physical environments and HPC Clusters.
  • Working knowledge of networking technologies and services such as TCP/IP, DNS, DHCP, PXE, VLANS, Radius, Proxies, etc.
  • Familiar with HPC networking architectures such as Infiniband and Ethernet RoCE
  • Experience with HPC job scheduling technologies, such as Slurm, PBS Pro, and Moab
  • Experience with parallel file systems such as Lustre and other storage technologies.
  • Experience with MPI or parallel and distributed computing libraries and codes.
  • Experience with HPC node provisioning and automation tools such as Puppet, Warewulf, Ansible, etc.
  • Experience managing SSL certificates,
  • Experience with virtualization platforms and tools such as KVM, Proxmox, Podman, Apptainer, or similar technologies.
  • Experience with authentication infrastructures such as LDAP, Kerberos, or FreeIPA
  • Understanding and experience with cybersecurity principles, tools, and practices, including knowledge of NIST 800-53, NIST 800-171, or CMMC Security Controls.
  • Relevant certifications in information technology or cybersecurity.
  • Proficiency in one or more programming/scripting languages such as C, C++, Java, Perl, Python, PHP, or Shell scripting.

Responsibilities

  • Install, configure, and manage Linux operating systems on physical and virtual servers and desktops and HPC clusters.
  • Plan and implement system upgrades, migrations, life cycle management, and change management processes
  • Create, manage, and support virtual machines (VMs) and containers based on business needs.
  • Monitor and optimize system performance, including resource allocation, load balancing, and recommend hardware/software upgrades.
  • Perform monitoring and troubleshooting of system hardware, software, and operating systems to maintain systems and preserve data integrity.
  • Provide user and system support, education, and training including coordination with computing staff for integrating services such as file systems, printing, and other system resources.
  • Implement and maintain security best practices across the computing environments.
  • Develops and maintains the software infrastructure for the HPC environment, including user interfaces, utility scripts, and software stacks.
  • Develop and maintain automated processes for system management.
  • Maintain comprehensive documentation of system configurations, processes, and procedures.
  • Stay current with industry trends and technologies related to systems administration, virtualization, and HPC technologies.
  • Manage, train, and provide operational oversight to junior-level system administrators; collaborate with cross-functional IT teams to ensure seamless integration of systems and services.
  • Perform duties as assigned in a responsible manner.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Entry Level

Number of Employees

1,001-5,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service