Operations Manager - HPC

Xcel EngineeringOak Ridge, TN
10d

About The Position

XCEL Engineering is seeking a qualified applicant for a Technical Operations Manager role. The TOM will serve as a key contributor to the success of project research initiatives by managing and advancing technical operations at a project level. This role involves close collaboration with the project's Principal Investigator (PI), oversight of high-performance computing (HPC) and storage infrastructure, and facilitation of user onboarding and offboarding. The ideal candidate will bring technical expertise, sound judgment, and a proactive approach to supporting project research computing environments.

Requirements

  • United States citizen with the ability to obtain a security clearance.
  • Bachelor's degree in Information Technology, IT Operations Management, or a related field.
  • A minimum of eight (8) years of relevant experience, or an equivalent combination of education and experience.
  • Strong technical knowledge of information systems management and systems architecture.
  • Proven ability to gather and interpret system requirements for complex research projects.
  • Ability to direct HPC technical work.
  • Excellent verbal and written communication skills for engaging with staff, sponsors, and stakeholders.
  • Demonstrated interpersonal skills that support collaboration, leadership, and team building.

Nice To Haves

  • IT project management experience.
  • Experience working in a research or technical environment.
  • Motivated self-starter who works independently and participates creatively in collaborative teams across the laboratory.
  • Ability to function well in a fast-paced research environment, set priorities to accomplish multiple tasks within deadlines, and adapt to ever changing needs.

Responsibilities

  • Maintain and advance technical operational duties across research projects, ensuring alignment with evolving scientific needs.
  • Collaborate with the project PI to manage and fulfill data requirements for research teams.
  • Lead the facilitation of technical onboarding and offboarding for users and projects, ensuring seamless transitions.
  • Manage the full hardware lifecycle, including provisioning and decommissioning of storage-as-a-service and HPC clusters.
  • Provide technical recommendations to improve system health, performance, and scalability.
  • Oversee full-cycle resource management, including intake and fulfillment of HPC requests.
  • Analyze incoming project requests using expert judgment and advise the PI on prioritization and feasibility to ensure they meet project needs.
  • Prepare and present reports on system usage, project financials, task status, and other key performance indicators.
  • Direct project technical operations and staff to ensure work priorities are met and shift priorities as required.
  • Attend project meetings and interpret technical requirements to staff and stakeholders.
  • Document and maintain operational procedures, workflows, and recommend improvements to enhance efficiency.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service