Lead Systems Engineer

Institute for Defense AnalysesPrinceton, NJ
Onsite

About The Position

The Institute for Defense Analyses (IDA) has an immediate career opening for a Lead Systems Engineer. This opening is located at IDA's Center for Communications Research in Princeton, New Jersey (CCRP). IDA offers a competitive salary, an excellent benefits package and a superior professional working environment. To the right individual, IDA offers the opportunity to have a major impact on key national programs while working in support of technical issues and projects. IDA is seeking a qualified individual to manage its High Performance Computing (HPC) resources, including compute clusters, parallel file systems and high-speed networks. The successful applicant will be an expert in the Linux operating system and have significant experience with CPU/GPU based systems, high-performance storage technologies (e.g. Lustre), HPC or High Throughput job allocation technologies (e.g. Slurm, HTCondor), parallel computing environments such as MPI and CUDA, and high-performance network technologies (e.g. InfiniBand, GigE). The incumbent will recommend technologies; work with vendors to specify equipment; supervise or participate in installation; maintain, administer and troubleshoot systems; install software to support research; ensure compliance with DoD and sponsoring Agency requirements; and help researchers get the most out of the systems. Moreover, the individual will act as part of a team to maintain the environment in which the HPC systems function and support the mission of IDA/CCR-P and its sponsor.

Requirements

  • Bachelor of Science degree in Computer Science or equivalent experience in related field.
  • Eight years minimum experience in Information Technology, which includes at least six in systems administration.
  • Possess advanced, subject matter expertise in design, administration, and support of servers, systems and software, using Linux/Unix and/or Windows.
  • Experience in more than one of the following areas is a requirement: High Performance Computing (HPC) systems or large cluster computing, including GPU based systems.
  • High performance storage technologies such as Lustre or Hadoop.
  • HPC or High Throughput Computing (HTC) job allocation technologies such as Slurm or HTCondor.
  • Parallel computing libraries and environments such as MPI and CUDA.
  • High performance network technologies such as InfiniBand and GigE
  • Authentication, access control, compliance and security in a DOD environment
  • Open Source software installation and support
  • Must be organized, self-motivated and able to work with moderate supervision.
  • Ability to communicate effectively in both written and verbal form and with all levels of employees; possess good interpersonal skills.
  • Must be willing to work hours outside of a regular schedule, including periodic on-call support.
  • Position requires ability to obtain and maintain Top Secret/SCI security clearance with full scope polygraph. Current TS/SCI with full scope polygraph clearance preferred.
  • Ability to obtain and maintain DOD 8570 IAT II certification.

Nice To Haves

  • Current TS/SCI with full scope polygraph clearance preferred.

Responsibilities

  • Takes the primary role as project leader and designer of new IT technology initiatives.
  • Develops test and integration plans for new systems and software in order to ensure compatibility with current infrastructure.
  • Provides operational support and maintenance, when necessary, to ensure systems functionality, availability, security and performance.
  • Ensures all systems meet or exceed the business and security requirements in accordance with IDA, DOD, NSA, DISA and DSS directives and guidelines.
  • Mentors junior staff and coordinates work assignments of junior administrators to ensure project schedules are maintained.
  • Prepares technical documentation, to include standard operating procedures and processes.
  • Develops resolutions to complex problems that require the frequent use of creativity. Coordinates resources to resolve problems when necessary.
  • Administers and maintains classified and unclassified systems and services to ensure optimum performance and availability
  • Maintains technical proficiency in new IT technologies; keeps abreast of industry trends, and makes recommendations to improve or advance services and system performance.
  • Communicates all computer network, system and service problems and outages immediately to the appropriate supervisors and/or managers.
  • Responds to critical after hours support issues.
  • Performs other duties as assigned.

Benefits

  • diverse health insurance options
  • generous 10% contribution to retirement
  • 6 weeks 100% paid parental leave
  • 20 days paid time off
  • tuition reimbursement
  • internal and external trainings
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service