INFRASTRUCTURE & HPC SYSTEMS ENGINEER

Federal Reserve SystemPhiladelphia, PA
Onsite

About The Position

You will ensure integrity, reliability, and availability of agile research computing environments by managing Windows/Linux server infrastructure, high-performance computing (HPC) clusters, and cloud/colocation/on-premises services. You will provide advanced specialized technical support to end users while developing automation tools and optimizing computational workflows to meet evolving rigorous research needs. You foster trust, open communication, shared goals and collaboration with stakeholders across the Federal Reserve System and externally.

Requirements

  • Bachelor’s degree in computer science, engineering, mathematics, or related field, or equivalent combination of education and experience.
  • Minimum of 5 years of relevant experience in HPC administration and systems engineering.
  • Extensive experience with Linux operating systems (Red Hat/CentOS) in an HPC environment.
  • Command line skills and proficiency in scripting languages (Python, Bash).
  • Experience with job scheduling systems (SLURM) and resource management.
  • Knowledge of parallel file systems and storage technologies (e.g. ceph, GPFS, Lustre, BeeGFS).
  • Familiarity with parallel programming models (MPI, OpenMP) and scientific computing frameworks.
  • Experience with configuration management and automation tools (Terraform).
  • Demonstrated specialized problem-solving abilities and analytical thinking.
  • Solid appreciation for research, sound judgment and healthy professional skepticism, understands sensitivities, considers big picture in addition to tactical details.
  • Ability to communicate effectively with PhD economists as well as with various levels of personnel and different types of specialists, strong interpersonal and listening skills, approachable.
  • Agile and comfortable working in evolving rigorous research environments.
  • Research support-oriented, responsive to time-sensitive matters and custom needs.
  • Must be a U.S. citizen, U.S. national, or U.S. permanent resident who is not yet eligible to apply for naturalization or who has applied for naturalization within the requisite timeframe. Permanent residents must sign a declaration of intent to become a U.S. citizen when eligible. Candidates who are not U.S. citizens or U.S. permanent residents may be eligible if they sign a declaration of intent to become a permanent resident and a U.S. citizen and meet other eligibility requirements.
  • Must undergo an applicable background check and comply with all applicable information handling rules.
  • Must provide work authorization to prove eligibility to work in the United States.

Nice To Haves

  • Experience with cloud environments
  • Experience with colocation services
  • Experience with GPU computing and accelerator technologies
  • Experience with container technologies (Docker)
  • Experience with configuration management and automation tools (Terraform)

Responsibilities

  • Respond to problems and maintain Windows and Linux server environments in research settings
  • Design, deploy, configure, and administer HPC clusters and associated systems
  • Monitor system health, performance metrics, and resource utilization to ensure optimal, efficient operation
  • Implement robust security protocols and perform regular maintenance including upgrades and patching
  • Manage job scheduling and workload optimization using tools like Slurm
  • Support and troubleshoot user endpoints, servers, and services in various environments (i.e. cloud, on-premises, collocation)
  • Participate in planning, budgeting, and monitoring of various environments
  • Develop tools and scripts to automate management and creation of systems and services in various environments
  • Create and maintain automation scripts to streamline system administration tasks
  • Optimize scientific applications and computational workflows for performance
  • Implement container technologies (Docker) for reproducible research
  • Support GPU computing and accelerator technologies for specialized workloads
  • Design and implement innovative HPC solutions to address evolving research requirements
  • Define and track performance metrics to ensure efficient current and future use of resources
  • Respond to research end user requests to diagnose problems and provide specialized technical support
  • Troubleshoot highly complex hardware and software issues in multi-user research environments
  • Resolve problems quickly and accurately with thorough follow-up to ensure complete resolution
  • Assist staff with IT-related problem resolution and use of facilities
  • Partner closely with researchers to understand computational needs and translate them into technical solutions
  • Collaborate with network, security, and data teams to ensure integrated operations
  • Build and maintain relationships with vendors and technology partners
  • Collaborate as technical advisor on infrastructure planning and technology roadmaps
  • Participate in product and technology evaluations, testing, and pilot activities to provide sound recommendations
  • Engage in Federal Reserve System, academic, and other HPC communities to stay current with emerging technologies and effective practices
  • Develop comprehensive documentation for systems, policies, and procedures
  • Create user guides and training materials for researchers utilizing HPC resources
  • Conduct workshops and training sessions on effective use of HPC resources and research computing tools

Benefits

  • Medical (4 options)
  • Prescription Insurance
  • Dental (3 options) Insurance
  • Vision Insurance
  • 401k/Thrift Plan with generous employer match
  • Employer-funded Pension Plan
  • Paid Vacation/Sick Time & Holidays
  • Monthly $200 Commuter Allowance
  • Flexible Spending Accounts
  • Healthcare Spending Accounts
  • Flexible Work Schedule available in most departments
  • Life Insurance
  • Long Term Disability Insurance
  • Tuition Reimbursement (undergraduate and graduate)
  • Parental Leave
  • Free onsite 24/7 Fitness Center including training classes, Peloton bikes and locker room / shower facilities
  • Onsite Cafeteria & Coffee Shop
  • Additional Convenience Benefits, Discounts and More
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service