About The Position

The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high-performance computing team, a clinical data warehouse team and a data services team. The Lead HPC Architect , Cybersecurity, High Performance Computational and Data Ecosystem, is responsible for designing, implementing, and managing the cybersecurity infrastructure and technical operations of Scientific Computing’s computational and data science ecosystem. This ecosystem includes a 25,000+ core and 40+ petabyte usable high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. The HPC system is the fastest in the world at any academic biomedical center (Top 500 list). To meet Sinai’s scientific and clinical goals, the Lead brings a strategic, tactical and customer-focused vision to evolve the ecosystem to be continually more resilient, secure, scalable and productive for basic and translational biomedical research. The Lead combines deep technical expertise in cybersecurity, HPC systems, storage, networking, and software infrastructure with a strong focus on service, collaboration, and strategic planning for researchers and clinicians throughout the organization and beyond. The Lead is an expert troubleshooter, productive partner and leader of projects. The lead will work with stakeholders to make sure the HPC infrastructure is in compliance with governmental funding agency requirements and to promote efficient resource utilizations for researchers This position reports to the Director for HPC and Data Ecosystem in Scientific Computing and Data.

Requirements

  • Bachelor?s degree in computer science, engineering or another scientific field. Master's or PhD preferred.
  • 10 years of progressive HPC system administration experience with Enterprise Linux releases including RedHat/CentOS/Rocky Systems, and batch cluster environment.
  • Experience with all aspects of high-throughput HPC including schedulers (LSF or Slurm), networking (Infiniband/Gigabit Ethernet), parallel file systems and storage, configuration management systems (xCAT, Puppet and/or Ansible), etc.
  • Proficient in cybersecurity processes, posture, regulations, approaches, protocols, firewalls, data protection in a regulated environment (e.g. finance, healthcare).
  • In-depth knowledge HIPAA, NIST, FISMA, GDPR and related compliance standards , with prove experience building and maintaining compliant HPC system
  • Experience with secure enclaves and confidential computing.
  • Proven ability to provide mentorship and technical leadership to team members.
  • Proven ability to lead complex projects to completion in collaborative, interdisciplinary settings with minimum guidance.
  • Excellent analytical ability and troubleshooting skills.
  • Excellent communication, documentation, collaboration and interpersonal skills.
  • Must be a team player and customer focused.
  • Scripting and programming experience.

Nice To Haves

  • Proficient with cloud services, orchestration tools, openshift/Kubernetes cost optimization and hybrid HPC architectures.
  • Experience with Azure, AWS or Google cloud services.
  • Experience with LSF job scheduler and GPFS Spectrum Scale.
  • Experience in a healthcare environment.
  • Experience in a research environment is highly preferred.
  • Experience with software that enables privacy-preserving linking of PHI.
  • Experience with Globus data transfer.
  • Experience with Web service, SAP HANA, Oracle, SQL, MariaDB and other database technologies.

Responsibilities

  • HPC Cybersecurity & System Administration: Design, implement, and manage all cybersecurity operations within the HPC environment, ensuring alignment with industry standards (NIST, ISO, GDPR, HIPAA, CMMC, NYC Cyber Command, etc.).
  • Implement best practices for data security, including but not limited to encryption (at rest, in transit, and in use), audit logging, access control, authentication control, configuration managements, secure enclaves, and confidential computing.
  • Perform full-spectrum HPC system administration: installation, monitoring, maintenance, usage reporting, troubleshooting, backup and performance tuning across HPC applications, web service, database, job scheduler, networking, storage, computes, and hardware to optimize workload efficiency.
  • Lead resolution of complex cybersecurity and system issues; provide mentorship and technical guidance to team members.
  • Ensure that all designs and implementations meet cybersecurity, performance, scalability, and reliability goals.
  • Ensure that the design and operation of the HPC ecosystem is productive for research.
  • Lead the integration of HPC resources with laboratory equipment for data ingestion aligned with all regulatory such as genomic sequencers, microscopy, clinical system etc.
  • Develop, review and maintain security policies, risk assessments, and compliance documentation accurately and efficiently.
  • Collaborate with institutional IT, compliance, and research teams to ensure all regulatory, Sinai Policy and operational alignment.
  • Design and implement hybrid and cloud-integrated HPC solutions using on-premise and public cloud resources.
  • Partner with other peers regionally, nationally and internationally to discover, propose and deploy a world-class research infrastructure for Mount Sinai.
  • Stay current with emerging HPC, cloud, and cybersecurity technologies to keep the organization’s infrastructure up-to-date.
  • Work collaboratively, effectively and productively with other team members within the group and across Mount Sinai.
  • Provide after-hours support as needed.
  • Perform other duties as assigned or requested.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service