HPC Software Engineer 3 - (HPC, Python, Scala)

AkinaFort Meade, MD
Onsite

About The Position

Designs, develops, tests, deploys, documents, maintains, and enhances complex and diverse software for HPC (high performance computing) systems based upon documented requirements. Requires either a very strong math background and/or a very strong computer hardware background to understand HPC architecture and the mathematical principles underlying the software applications. HPC systems may include processing-intensive analytics, novel algorithm development, manipulation of extremely large data sets, real-time systems, and systems incorporating data repositories, data transport services, and application and systems development and monitoring. Works individually or as part of a team; reviews and tests software components for adherence to design requirements, documents test results, and resolves software problem reports. Utilizes software development and software design methodologies appropriate to the development environment and provides input to system design including hardware/software trade-offs, software reuse, OSS/COTS/GOTS use, and requirements analysis and synthesis from system level to individual software components. Supports efforts to understand performance limitations of FOSS, COTS, and GOTS software, frameworks, and tools deployed on high performance computers, including metrics collection, testing, and informing software or hardware architecture changes. Design, document and execute tests of FOSS, COTS and GOTS software architectures to determine what aspects of the software and/or computer infrastructure are limiting performance Research and identify metrics necessary to understand performance limitations of the software and/or computer infrastructure to support testing Research and identify monitoring necessary to support timely alerting of infrastructure and software failures encountered during testing Identify hardware and software failure trends and develop mitigations encountered during testing Perform root cause analysis Work with the customer metrics and monitoring team to introduce new metrics capabilities to support testing Modify the software architecture and/or develop new software capabilities to overcome performance limitations encountered during testing Review and test software components for adherence to design requirements and document test results Resolve software problem reports Provide input to software components of system design, including hardware/software trade-offs, software reuse, use of OSS, COTS, and GOTS software, and requirements analysis and synthesis from system level to individual software components

Requirements

  • Strong math background and/or strong computer hardware background
  • Working knowledge of Configuration Management (CM) tools
  • Working knowledge of Web Services implementation
  • Software development and software design methodologies appropriate to the development environment
  • Ability to develop software for HPC systems
  • Bachelor’s degree in Math, Computer Engineering, Computer Science, or related discipline from an accredited college or university, plus seven (7) years of experience as an HSE in programs and contracts of similar scope, type, and complexity.
  • Master’s degree in Math, Computer Engineering, Computer Science, or related discipline from an accredited college or university, plus five (5) years of experience as an HSE in programs and contracts of similar scope, type, and complexity.
  • Nine (9) years of experience as an HSE in programs and contracts of similar scope, type, and complexity.

Nice To Haves

  • Experience with distributed computing FOSS software frameworks such as Apache Spark, Dask, and Ray
  • Experience developing software in Python, Scala, Java, and C

Responsibilities

  • Designs, develops, tests, deploys, documents, maintains, and enhances complex and diverse software for HPC systems.
  • Reviews and tests software components for adherence to design requirements.
  • Documents test results.
  • Resolves software problem reports.
  • Utilizes software development and software design methodologies.
  • Provides input to system design including hardware/software trade-offs, software reuse, OSS/COTS/GOTS use, and requirements analysis and synthesis.
  • Supports efforts to understand performance limitations of FOSS, COTS, and GOTS software, frameworks, and tools deployed on high performance computers.
  • Designs, documents, and executes tests of FOSS, COTS and GOTS software architectures.
  • Researches and identifies metrics necessary to understand performance limitations.
  • Researches and identifies monitoring necessary to support timely alerting of infrastructure and software failures.
  • Identifies hardware and software failure trends and develops mitigations.
  • Performs root cause analysis.
  • Works with the customer metrics and monitoring team to introduce new metrics capabilities.
  • Modifies the software architecture and/or develops new software capabilities to overcome performance limitations.

Benefits

  • 24 days PTO accrued annually
  • 11 federal holidays
  • 401k is 100% vested on your start date
  • Company makes a direct contribution worth 10% of your salary to 401k
  • Company covers 100% of healthcare costs for employees
  • Company covers 50% toward dependents' healthcare costs
  • Educational assistance towards college classes
  • Company will cover costs associated with job related training and certifications
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service