Lawrence Berkeley National Laboratory-posted 3 months ago
$153,492 - $187,596/Yr
Full-time • Mid Level
Berkeley, CA
1-10 employees

The National Energy Research Scientific Computing Center (NERSC) is seeking a versatile Linux System / Platform Engineer to join our team building and managing Linux-based infrastructure. More than ever, scientific discovery transforms our world. NERSC is at the forefront, operating some of the world’s largest supercomputers for thousands of researchers who use computational power to solve society’s most challenging problems. In this exciting role, you will help build and manage our container and virtual machine platforms and use them to deploy systems that keep our supercomputing center running smoothly and help researchers make the most of its resources, including API endpoints, scientific research tools, authentication, identity and access management, databases, and more. You’ll join a group of systems and software engineers and will routinely work with other groups across NERSC on a variety of projects. You’ll also collaborate with our counterparts at peer scientific facilities, also operated by the Department of Energy Office of Science, to streamline cutting-edge research using automation and cloud-native and AI tools and techniques.

  • Work with a team to build and manage Linux systems and storage infrastructure.
  • Troubleshoot and solve complex technical problems with other team members.
  • Install, upgrade, and secure equipment and services.
  • Develop and refactor scripts and other code.
  • Participate in 24x7 on-call rotation.
  • Coordinate small project teams or other initiatives (such as the rollout of a new service or system, or a major equipment or software upgrade).
  • Work with vendors to prioritize efforts and enhance their technologies to meet user needs.
  • Work with researchers to deploy services using Spin, our container cloud platform based on Kubernetes.
  • Collaborate within NERSC and across the DOE community to develop services, integrate them into the new NERSC supercomputer Doudna, the NERSC data center environment, and across multiple DOE facilities.
  • Present developments to NERSC staff and the broader HPC community at science conferences and industry meetings.
  • Typically, 8+ years of related experience with a Bachelor’s degree; alternatively, 6+ years with a Master’s degree; or equivalent career experience.
  • 4+ years of experience managing large-scale Linux-based system deployments in a high-performance computing, cloud computing, or hyper-scale environment.
  • Experience with some or all of our key technologies: containers (such as Docker or Kubernetes), virtualization (such as Proxmox or VMware), cloud-based deployment (such as AWS, Azure or GCP), using and developing AI (or machine learning) tools and services, identity and access management, database administration, tuning, and troubleshooting, networked storage systems, backup technologies.
  • Familiarity with automated provisioning systems (such as Chef, Foreman, or Terraform).
  • Familiarity with configuration management systems (such as Ansible or Puppet).
  • Working knowledge of Linux system engineering and security practices.
  • Ability to resolve complex issues in creative and effective ways and derive technical solutions in a collaborative environment to meet end user requirements or needs.
  • Demonstrated ability to work independently as well as collaboratively in large projects, and contribute to an active and respectful intellectual environment.
  • Creative, positive, and collaborative work style.
  • Excellent oral and written communication skills.
  • Typically, 12+ years of related experience with a Bachelor’s degree; alternatively, 8+ years with a Master’s degree; or equivalent career experience.
  • Experience in software engineering or complex scripting.
  • Experience managing network equipment.
  • Ability to lead and coordinate projects.
  • Ability to analyze and resolve significant and unique issues requiring evaluation of multiple intangible factors.
  • Ability to exercise independent judgment in methods, techniques and evaluation criteria for obtaining results.
  • Full-time, career appointment, exempt (monthly paid) from overtime pay.
  • Flexible work mode, and hybrid schedules may be considered.
  • Salary range for Level 3: $136,440 to $230,244 per year; targeted range $153,492 to $187,596.
  • Salary range for Level 4: $155,388 to $262,224 per year; targeted range $174,804 to $213,660.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service