Senior Systems Engineer

IRENDallas, TX

About The Position

The HPC Senior Systems Engineer will spearhead the design, deployment, and optimization of our high-performance computing (HPC) systems with a focus on HPC clustering, Kubernetes, Slurm, and management tools. The ideal candidate will seamlessly blend traditional HPC solutions with modern container orchestration, ensuring a versatile and scalable computing environment that can be offered to our customers.

Requirements

  • Minimum of 5 years of experience in HPC system architecture with proven expertise in designing, deploying, and managing HPC clusters.
  • Extensive knowledge of Kubernetes, with a focus on its integration within HPC environments.
  • Hands-on experience with the Slurm workload manager and its intricacies.
  • Familiarity with HPC management tools and software, ensuring efficient system monitoring and troubleshooting.
  • Proven track record of resolving complex system challenges and enhancing operational performance.
  • A Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • Understanding of cloud platforms and their integration into HPC ecosystems.
  • Deep knowledge of network and storage solutions commonly used in HPC setups.

Nice To Haves

  • Relevant certifications in Kubernetes, HPC technologies, or system architecture are advantageous.

Responsibilities

  • Lead the deployment and maintenance of HPC clusters, ensuring they operate effectively and maximise availability
  • Integrate and manage HPC software components such as Kubernetes, Slurm, cluster management software, and any infrastructure required to operate the HPC environment
  • Stay abreast of advancements in HPC, Kubernetes, and associated technologies, bringing innovations into our operations and product options.
  • Collaborate with technical teams to establish and implement best practices for system maintenance and optimization.
  • Draft comprehensive documentation, including system designs, operational procedures, and best practice guidelines.
  • Facilitate the selection and integration of relevant management tools to monitor, troubleshoot, and enhance HPC operations.
  • Provide technical leadership and training to other team members, fostering an environment of continuous learning and improvement.

Benefits

  • Highly competitive compensation package
  • Base salary
  • Annual performance incentives
  • Opportunities to build long-term wealth through equity programs
  • Total rewards package
  • 100% company paid health insurance premiums (medical, dental, and vision) for employees
  • 75% company paid coverage for dependents
  • Company-paid short-term and long-term disability insurance
  • Voluntary life, critical illness, and accident coverage available
  • Health Savings Accounts (HSA)
  • Employee Assistance Program and wellness resources
  • 401(k) retirement plan with company match
  • Access to financial planning and legal services
  • Paid Time Off (PTO)
  • Paid holidays
  • Internal skills training and advancement pathways
  • Professional development to support certifications, continuing education, or role related training
  • Company events and team-building activities
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service