HPC Hardware Engineer

CGGHouston, TX
4dOnsite

About The Position

Viridien (www.viridiengroup.com) is an advanced technology, digital and Earth data company that pushes the boundaries of science for a more prosperous and sustainable future. With our ingenuity, drive and deep curiosity we discover new insights, innovations, and solutions that efficiently and responsibly resolve complex natural resource, digital, energy transition and infrastructure challenges. Job Description We are seeking a highly motivated individual to join our team, focusing on the development, implementation, and support of our Houston Data Center infrastructure. The ideal candidate will have demonstrable skills in High Performance Computing (HPC) and a proactive approach to data center operations. Join us in shaping the future of HPC and Cloud technology at Viridien!

Requirements

  • Must be IT literate with good knowledge of Linux distributions.
  • Technically proficiency in HPC system architecture
  • Expert in structured cabling, data center focused.
  • Expertise in diagnosing, troubleshooting and solving technical issues
  • Excellent communication, presentation, customer service and team skills.
  • Proven record of successful mentorship of juniors.
  • Delivery focused and problem-solving attitude.
  • Effective verbal and written communication skills
  • Project management skills are a plus
  • You will have a BSc or MSc in Computer Science, Computer Engineering, or Computer Information Systems.
  • 5+ years’ experience in data center hardware infrastructure, with at least 2 years focused on GPU systems and immersion cooling technologies.
  • HPC systems architecture, compute and storage
  • Project management experience is a plus
  • Structured cabling
  • Linux distributions

Nice To Haves

  • Scripting languages
  • Computational thinking

Responsibilities

  • Lead hardware maintenance
  • CPU/GPU high-performance computing
  • Oil immersion servers
  • Air cooled servers
  • Storage Infrastructure services
  • Equipment ordering
  • Building Bill of Materials for new installations
  • Lead spare parts management
  • Lead hardware installation
  • Data center upkeep
  • Rack cabling standards
  • IT Workshop Equipment audits
  • Hardware decommissioning
  • Lease management
  • Documentation management
  • Incident management
  • Problem management
  • Change management
  • Continual service improvement
  • Lead and participate in the deployment of new systems and applications
  • Participate in meetings and conference calls related to project and troubleshooting activities
  • This role will be required to be on-call
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service