HPC Software Engineer

OfinnoReston, VA
54d

About The Position

The Advanced Media Lab (AML) at Ofinno is pioneering research in next-generation video coding standards, such as JVET and MPEG. To accelerate innovation, AML relies on large-scale computational experiments powered by high-performance computing (HPS) infrastructure. We are seeking an HPC Software Engineer to design, manage, and maintain the lab's computing platform that enables our researchers to develop, test, and optimize advanced video coding technologies. This individual will play a critical role in ensuring the stability, scalability, and efficiency of AML's HPC environment while collaborating closely with research engineers to streamline experimental workflows and support activities tied to the JVET/MPEG meeting cycle. This is a mid-level position ideal for an engineer who thrives at the intersection of system administration, high-performance computing, and applied research support. Based on your experience and qualifications, you may join us as an Engineer or Senior Engineer.

Requirements

  • Bachelor's or Master's degree in Computer Science, Electrical/Computer Engineering, or a related field
  • 3+ years of experience in cluster or system administration within an HPC or research computing environment
  • Proven experience in Linux system setup, administration, and management in a network environment
  • Hands-on experience with job schedulers, specifically Slurm or alternatives such as HTCondor or Sun/Oracle Grid Engine, as a user or administrator
  • Experience working with or supporting video codec standardization teams, such as JVET
  • Solid understanding of basic server room setup and network infrastructure management, including power and connection planning
  • Proficiency in scripting for automation and workflow management
  • Strong problem-solving, debugging, and communication skills

Nice To Haves

  • Familiarity with system management or deployment and monitoring tools in HPC environments preferred
  • Experience with continuous integration and testing frameworks or version control, e.g., Git, preferred
  • Experience managing or maintaining GPU clusters, i.e. CUDA environments and containerized workloads, preferred

Responsibilities

  • Set up, administer and maintain Linux-based high-performance computing clusters, ensuring reliability, scalability, and high availability for research workloads.
  • Administer and manage current job scheduling system (Slurm), and seek evolutions in the future, e.g., better configuration, newer versions, or alternative workload managers to better meet users' needs.
  • Develop and maintain automation scripts to streamline experiment execution, data handling, and routine regression testing for video codec research.
  • Collaborate with AML research engineers to understand their computational needs and support JVET/MPEG-related experimental workflows.
  • Support the research team in conducting experiments aligned with JVET/MPEG meeting cycles, ensuring timely execution and data availability.
  • Monitor and troubleshoot cluster performance, resource allocation, and system-level issues.
  • Ensure system security and performance optimization, including updates, patches, and user access management.
  • Serve as an on-call resource for maintenance and urgent support requests.
  • Document system setup, configurations and procedures, contributing to long-term sustainability and knowledge sharing.

Benefits

  • 401(K) matching -- We help you plan and save for retirement with a 401(K) matching program that’s available on day one.
  • Free healthcare plans -- Ofinno covers full premiums for you are your family on select healthcare plans, including employer HSA contributions if applicable.
  • Free Food -- Our kitchen is always fully stocked, including lunch, protein bars, fruit, sodas, coffee and tea.
  • Unlimited Paid Time Off -- Our lives are enriched by family time, vacations, and personal time, so we offer unlimited paid time off and sick leave.
  • On-campus gym -- Unwind, reduce stress and feel great – even when you’re at work.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service