We are looking for an experienced HPC Software Engineer to join their rapidly growing team! Requires an active TS/SCI Poly security clearance Description: The HPC Software Engineer designs, develops, tests, deploys, documents, maintains, and enhances complex and diverse software for HPC (high performance computing) systems based upon documented requirements. Critical for an HSE is either a very strong math background and/or a very strong computer hardware background. One or both of these are required for the HSE to be able to understand the intricacies of the HPC architecture that is being targeted by the software being developed, or to understand the complex mathematical principles which underlie the software applications being developed. The HPC systems might include, but are not limited to, processing-intensive analytics, novel algorithm development, manipulation of extremely large data sets, real-time systems, and systems which incorporate data repositories, data transport services, and application and systems development and monitoring. Works individually or as part of a team. Reviews and tests software components for adherence to the design requirements and documents test results. Resolves software problem reports. Utilizes software development and software design methodologies appropriate to the development environment. Provides specific input to the software components of system design to include hardware/software trade-offs, software reuse, use of Open-Source Software (OSS) and/or Commercial Off-The-Shelf (COTS)/Government Off-The- Shelf (GOTS) software in place of new development, and requirements analysis and synthesis from system level to individual software components. Working knowledge of Configuration Management (CM) tools and Web Services implementation. The project represents a growing group of efforts targeted at understanding the performance limitations encountered by FOSS, COTS and GOTS software, frameworks and tools deployed on High Performance Computers. Team members design, build, deploy, test and interpret output from metrics collection capabilities to identify performance limiters and develop courses of action to overcome them. Test results are used to inform software modification, deployment of alternate software architectures and hardware architecture enhancement. Example tasks include: Design, document and execute tests of FOSS, COTS and GOTS software architectures to determine what aspects of the software and/or computer infrastructure are limiting performance. Research and identify metrics necessary to understand performance limitations of the software and/or computer infrastructure to support testing. Research and identify monitoring necessary to support timely alerting of infrastructure and software failures encountered during testing. Identify hardware and software failure trends and develop mitigations encountered during testing. Perform root cause analysis. Work with the customer metrics and monitoring team to introduce new metrics capabilities to support testing. Modify the software architecture and/or develop new software capabilities to overcome performance limitations encountered during testing. Experience with the distributed computing FOSS software frameworks below is beneficial. Apache Spark, Dask, Ray Experience developing software in the languages below is beneficial. Python, Scala, Java, C
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed