IT Sr. Manager, Research Computing (IT@JH)

Johns Hopkins UniversityBaltimore, MD
33dHybrid

About The Position

IT@JH Research Computing is seeking an IT Sr. Manager which is a dual-role position that blends high-level technical expertise in high-performance and AI computing with managerial oversight for the Research Computing engineering team. The position provides hands-on leadership in designing, deploying, and optimizing HPC/AI infrastructure, CPU/GPU clusters, high-bandwidth networks, parallel and object storage, and large-scale scientific software environments, while also managing day-to-day team operations, project execution, and stakeholder coordination. This role supports faculty, researchers, and campus partners by developing reliable and scalable computing systems, guiding technical staff, and ensuring a cohesive, well-documented, and service-oriented operational model. Assignments are both project-based and operational, requiring independent decision-making, technical leadership, and strong communication across the research computing ecosystem. This role also collaborates closely with the IT Architect and supports strategic planning with the Director of Research Computing.

Requirements

  • Bachelor's Degree.
  • Seven years of progressively responsible IT management experience.
  • Additional education may substitute for required experience and additional related experience may substitute for required education beyond a high school diploma/graduation equivalent, to the extent permitted by the JHU equivalency formula
  • Cloud Serverless Computing Architecture - Advanced
  • Communication - Advanced
  • Information Technology Infrastructure Library - Advanced
  • IT Documentation - Advanced
  • IT Services Management - Advanced
  • IT Strategic Planning - Advanced
  • Project Management - Advanced
  • Software Development Life Cycle - Advanced
  • User Experience - Advanced

Nice To Haves

  • At least 5 years of direct experience in high-performance computing environments, including administration of large, multi-user research clusters with tens of thousands of CPU cores, hundreds of GPUs, and multi-petabyte distributed storage systems.
  • Seven or more years of progressive IT systems experience, including Linux systems administration, IT operations, infrastructure support, or technical leadership roles.
  • Strong proficiency in Linux engineering and automation, with hands-on expertise writing maintainable operational tooling in Python, Bash, and SQL for provisioning, monitoring, lifecycle management, and workflow optimization.
  • Demonstrated experience configuring and optimizing HPC resource management and cluster orchestration platforms such as Slurm, Bright Cluster Manager, and xCAT, including scheduler policy tuning and usage analytics.
  • Proven ability to administer and troubleshoot large-scale parallel and distributed storage systems such as GPFS, VAST, WEKA, and ZFS, with experience in quota management, failure recovery, capacity planning, and performance monitoring.
  • Deep experience building and maintaining observability and monitoring stacks using Prometheus, Grafana, InfluxDB, Telegraf, and custom exporter tooling, including development of dashboards for GPU utilization, node health, and job efficiency.

Responsibilities

  • As a member of a senior management team, contributes or leads planning to achieve organizational goals by prioritizing initiatives and coordinating the evaluation, deployment, and management of current and future technologies.
  • Develops technology solutions to anticipate the organization's needs, be cost-effective, reliable and compatible with existing and emerging technologies.
  • Anticipates change and responds when technology requirements emerge and evolve.
  • Based on understanding of organizational goals, mission and culture, assesses impact and effectiveness of technology to ensure it supports the organization's needs.
  • Helps establish budgetary goals and provides input towards priorities. May develop IT operations budget.
  • Works with constituents in conjunction with other IT leaders to interpret customer business needs and makes recommendations for strategic investments in technology, applications, business process, personnel, etc. that meets the agreed upon goals of the organization.
  • Ensures that applicable Hopkins policies, practices, regulatory requirements are addressed and followed within his/her area of responsibility.
  • Represents senior organizational leadership, often with delegated authority, in meetings both within and outside of Hopkins.
  • Represents the interest of the organization at industry, state and federal meetings to ensure that the best interests of the organization are considered.
  • Manages the customer relationship and satisfaction as well as adherence to the contractual obligations.
  • Creates and promotes a culture of excellent customer service.
  • Establishes and maintains ties with colleagues throughout the institution to ensure optimal collaboration and coordination of effort.
  • Maintains relationships with strategic technology vendors for the organization.
  • Has direct responsibility for the design, development, and application of technical solutions that satisfy customer needs and are essential to the ongoing operations of the department or IT function.
  • Is responsible for the management of multiple IT projects that impact the department or IT function, including planning, and monitoring progress toward completion.
  • Ensures continuous delivery of information technology support and services through direct management of service level agreements.
  • Recruits, develops, retains, and organizes staff.
  • Assigns tasks, monitors progress, and provides guidance.
  • Perform other related duties as requested.
  • Other duties as assigned.
  • Supervises Technical Staff within Research Computing
  • Senior Systems Engineers
  • Systems Administrators
  • HPC Software Engineers

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Manager

Industry

Educational Services

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service