HPC/AI Data Performance Engineer

Lawrence Berkeley National LaboratoryBerkeley, CA
113d$156,864 - $191,724

About The Position

In this exciting role, you will serve as a Data Performance Engineer in NERSC’s Application Performance Group, architecting HPC and AI data services that advance fundamental science. You’ll optimize storage systems for Doudna, NERSC’s next supercomputer, and develop I/O and data management solutions for the American Science Cloud. Working with lab, academic, and industry partners, you will help design and deploy advanced storage and data management solutions, support I/O needs for HPC and AI applications, and collaborate with scientists to optimize data workflows for their research. The selected candidate(s) will be hired at the Computer Systems Engineer 3 or 4 (CSE3 or CSE4) depending on your level skills and experience.

Requirements

  • Bachelor’s degree in Physical Sciences, Computer Science or related field or equivalent is required. Masters and PhD degrees in similar disciplines are preferred.
  • Typically requires a minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or equivalent experience.
  • Wide-ranging experience in the areas of data management, storage and I/O as applied to scientific data.
  • Ability to troubleshoot and resolve complex issues in creative and effective ways.
  • Ability to network and collaborate with key contacts outside their own area of expertise.
  • Excellent oral and written communication skills.
  • Excellent software development skills.
  • Proven ability to work productively both independently and as part of an interdisciplinary team balancing divergent objectives involving research, code development, supporting software and consulting with scientists.

Nice To Haves

  • Familiarity with HPC/AI storage architectures and technologies.
  • Experience with I/O optimization of scientific and/or AI workloads.
  • A proven track record of software development in computing, AI, or domain sciences.
  • Familiarity with computing hardware, storage systems, and data management systems.
  • Ability to work on and resolve significant and unique issues where analysis of situations or data requires an evaluation of intangibles.

Responsibilities

  • Develop AI storage and I/O services on NERSC’s advanced computing and data systems to support fundamental science.
  • Support storage and I/O software on NERSC supercomputers, deploy new cutting-edge tools and frameworks for scalable scientific workflows.
  • Provide expert I/O engineering engagement and training events to scientists and users of NERSC computing resources.
  • Engage with the AI and HPC storage community to stay on top of the latest advancements in services and software.
  • Shape future NERSC supercomputers, evaluating new storage systems for AI and HPC.
  • Collaborate with scientists and industry partners to enable transformative science.
  • Determine methods and procedures on new assignments and may coordinate activities of other personnel.
  • Network with key contacts outside their own area of expertise.
  • Work on and resolve complex issues where analysis of situations or data requires an in-depth evaluation of variable factors.
  • Exercise judgment in selecting methods, techniques, and evaluation criteria for obtaining results.

Benefits

  • Exceptional health and retirement benefits, including pension or 401K-style plans.
  • Opportunities to grow in your career: Tuition assistance program.
  • A culture where you’ll belong. We are invested in our teams!
  • In addition to accruing vacation and sick time, we also have a Winter Holiday Shutdown every year.
  • Parental bonding leave (for both mothers and fathers).
  • Pet insurance.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Bachelor's degree

Number of Employees

101-250 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service