About The Position

The AI, Learning and Intelligent Systems (ALIS) Group within the National Lab of the Rockies’ (NLR) Computational Science Center has an exciting opportunity for a graduate student to actively engage in a project developing an ecosystem of analysis and simulation tools for understanding power consumption and infrastructure requirements of AI data centers. We are looking for a 3-month spring/summer student (remote, in-person, or hybrid). As a valued member of the team, the graduate student will collaborate closely with peers to improve NLR applications. The successful candidate will use software-based power measuring tools to sample the consumption of AI LLM workloads (fine-tuning, offline and online inference) on NLR’s supercomputing (HPC). We particularly welcome candidates with creative problem-solving skills, interest in cross-disciplinary collaboration, and a passion for NLR's mission.

Requirements

  • Minimum of a 3.0 cumulative grade point average.
  • Undergraduate: Must be enrolled as a full-time student in a bachelor’s degree program from an accredited institution.
  • Post Undergraduate: Earned a bachelor’s degree within the past 12 months. Eligible for an internship period of up to one year.
  • Graduate: Must be enrolled as a full-time student in a master’s degree program from an accredited institution.
  • Post Graduate: Earned a master’s degree within the past 12 months. Eligible for an internship period of up to one year.
  • Graduate + PhD: Completed master’s degree and enrolled as PhD student from an accredited institution.
  • Currently enrolled in or recently graduated from a master’s degree, or currently enrolled in a PhD program in an Engineering, Computer Science, Applied Math, or related field
  • Demonstrated experience with software development in Python
  • Demonstrated experience with high-performance computing
  • Demonstrated experience with coding repositories i.e. Git
  • Demonstrated experience with conducting sensitivity analysis

Nice To Haves

  • Demonstrated experience with MLPerf and VLLM
  • Demonstrated experience in reading and writing scientific documents
  • Demonstrated experience with HPC i.e. Bash/Shell/Command line and SLURM
  • Demonstrated experience with AI LLM workloads, generative AI etc.

Responsibilities

  • Contribute to the development of software to execute power characterization experiments of AI LLM workloads under different parameter assumptions i.e. sensitivity analysis
  • Performing computational experiments, running batch experiments on NLR HPC using SLURM
  • Analyzing results capturing trends and tradeoffs between parameter configurations
  • Coordinating data exchange to inform other team members’ efforts
  • Contribute to writing and presenting reports and conference proceedings related to characterization tests

Benefits

  • Benefits include medical, dental, and vision insurance
  • 403(b) Employee Savings Plan with employer match
  • sick leave (where required by law)
  • NLR employees may be eligible for, but are not guaranteed, performance-, merit-, and achievement- based awards that include a monetary component.
  • Some positions may be eligible for relocation expense reimbursement.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service