RCH Solutions-posted 4 days ago
Full-time • Mid Level
Remote • Wayne, PA
101-250 employees

RCH Solutions is seeking an HPC Engineer to work closely with customer stakeholders, scientists, and IT professionals to deliver Compute at Scale and support our customer's scientific initiatives . The objectives for this role center on developing, evolving, and administering HPC platforms along with support for Scientific applications, workflows, and other related infrastructure both on-prem and Cloud hosted. Our ideal candidate also has hands on experience with Linux system administration as well as solution architecting and engineering (on-prem and cloud based) and will be instrumental in transforming how IT computing services are leveraged to support our client's growth. This role will involve driving architecture, roadmaps, and execution of projects to establish and operate IT infrastructure best practices for customers.

  • Collaborating with cross-discipline team members and customers to deliver HPC and peripheral Compute at Scale services.
  • Thorough understanding of related industry best practices.
  • Supporting internal and customer Architecture and Design efforts.
  • Supporting customers with their workflow pipelines (advisory and hands-on).
  • Comprehensively documenting new and existing computational assets.
  • Maintaining the flexibility to pivot as engagement scopes may evolve.
  • Support for AWS & GCP Cloud applications, migrations, and modernization.
  • CloudOps / IaC for on-going platform management.
  • Setup and configuration of AWS & GCP Cloud infrastructure for new platform builds .
  • Ensur ing system compliance with company security standards and applicable regulatory requirements.
  • Transition support for modernized services to operational teams .
  • Provide engineering level troubleshooting and services restoration for operational issues as they arise on supported platforms .
  • Provide training/mentorship for junior level team members.
  • Escalation point on multiple engag ements to ensure resolution
  • A bachelor’s degree or master’s degree in Computer Science or related field.
  • 5 + years of experience administering HPC clusters and systems.
  • Experience with SLURM and Grid Engine scheduling software preferred.
  • 5 + years of professional experience in Solution Architecture or Cloud Infrastructure Deployment and support.
  • 5+ years professional experience developing or administering compute solutions for Scientific / Research IT domains, Life Sciences being preferred.
  • Experience with POSIT products (Package Manager, Connect, Workbench) either in an end-user or administrator capacity.
  • Experience developing scientific workflows on HPC systems using Nextflow
  • Extensive command-line system administration experience: User and group management Advanced knowledge of Active Directory, DNS, DHCP, LDAP, NFS, SMB Building applications from source code, installing, maintaining, and troubleshooting application-level Linux and scientific software in line with industry best practices. Installation of Linux operating system and fine tuning Familiarity with leveraging and maintaining Linux package management systems Intermediate OS level networking knowledge.
  • Experience using with scripting tools, automation tools, and configuration management tools Ansible , Terraform and Cloud Formation experience preferred Experienc e administering and integrating Scientific / Research applications.
  • Strong time-management skills; able to complete projects in a timely manner, plan and prioritize tasks while keeping leadership and stakeholders updated regularly on status.
  • Excellent communication skills, including preparation of written documentation for IT colleagues and end users .
  • Proactive thinking skills to identify potential issues and solution options prior to incidents occurring.
  • Extreme attention to detail is needed to interface with multi different clients simultaneously.
  • Ability to understand and analyze complex technical problems and situations.
  • Candidates must be a passionate engineer with a strong vision and a desire to stay on top of trends in the Scientific Computing sector.
  • Ability to work independently or with a team
  • Ability to take a project from start to finish with minimal supervision
  • RCH provides services and solutions for the unique challenges of Life Sciences advanced computing, and leverages teams with cross-functional IT skills to meet these challenges. The ideal candidates for this role will have experience working with cross-functional IT (Public Cloud skills being a plus) and sciences skillsets.
  • Experience with Python, R, or other related data science programming languages.
  • Experience working with databases and/or supporting.
  • Experience managing large amounts of data effectively.
  • Experience working with AI/ML technologies.
  • Experience with c ontainerizing compute workload via Docker or Singularity.
  • Experience with Nvidia DGX systems.
  • A competitive salary and bonus package based on experience
  • Comprehensive health and wellness benefits, including Medical, Dental, and Vision Insurance
  • Company-provided Life and Long-Term Disability Insurance
  • Company-sponsored 401(k) Plan
  • Company-provided continuing education benefit
  • Team-focused culture and unlimited opportunity for advancement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service