Sr Systems Engineer HPC

ShellHouston, TX
1d$124,000 - $186,000

About The Position

What’s the role If you have a passion for delivering and continuously improving world-class IT operational services, then this opportunity will allow you to play a pivotal role in helping deliver and support digital solutions that provide the most value to Shell. The High-Performance Computing (HPC) Team core purpose is to deliver differentiated IT services in a secure, reliable and affordable manner enabling business value. In this role, you will work with colleagues and users across Shell globally. The HPC environment within Shell is constantly changing as we adapt to evolving business needs with innovative solutions and reliable services to our Shell customers. The global nature and large-scale use of HPC within Shell provide a challenging and exciting work environment.

Requirements

  • Must have legal authorization to work in the U.S. on a full-time basis for anyone other than the current employer
  • Bachelor’s Degree is required
  • 7+ years of experience in a Linux-based High-Performance Computing (HPC) environment, including management and support of HPC simulation applications
  • Experience with cloud-native technologies such as AWS, Kubernetes, containerization, Terraform, Python, and AI/ML frameworks
  • Proficiency in programming languages including C, ProC, C++, Java, Perl, Python, Ansible, Lua, PHP, HTML, CSS
  • Experience with Unix/Linux operating systems, including Red Hat
  • Hands-on experience administering HPC middleware (e.g., SLURM, web-based HPC tools), including configuration, patching, and troubleshooting
  • Experience designing and managing heterogeneous compute clusters (CPUs, GPUs, etc.)
  • Familiarity with relational databases such as Oracle and MySQL
  • Strong familiarity with Command Line Interfaces (CLI) and REST APIs
  • Strong analytical, problem-solving, communication, and interpersonal skills
  • Ability to work effectively in distributed, cross-functional, and cross-cultural teams
  • Demonstrated collaboration and knowledge-sharing skills
  • Understanding of ITIL processes, including Request, Incident, and Change Management
  • Ability to navigate conflict and ambiguity professionally and deliver results through influence and teamwork

Responsibilities

  • Own and manage the HPC operating system provisioning environment, ensuring reliable and repeatable deployments at scale
  • Contribute to and influence architecture and design decisions for on‑prem and hybrid HPC platforms
  • Operate and maintain hybrid cloud and on‑premises compute clusters, ensuring high availability and performance
  • Automate operational processes using Bash, Python, and Ansible to improve reliability, consistency, and efficiency
  • Ensure systems remain compliant with security and technical policies, including OS patching, configuration management, and migrations between major OS releases
  • Troubleshoot complex issues, perform root cause analysis, and implement scalable, long‑term solutions
  • Collaborate closely with HPC Project Management and suppliers to support hardware planning, procurement, and deployment activities
  • Define, guide, and implement HPC operational best practices across platforms and services

Benefits

  • For regular full-time or regular part-time employees of the Company (participating companies as listed in the Summary Plan Description), insurance coverage options include medical, dental, vision coverage, life Insurance, Business Travel Accident Insurance, and Occupational Accidental Death Benefit programs.
  • Employees also participate in a company pension plan and a 401(k) plan.
  • Paid leave includes up to 6 weeks of paid vacation time, up to 11 paid holidays, and parental leave offering 16 weeks of paid leave for birthing parents, and 8 weeks of paid leave for non-birthing parents.
  • Additionally, employees are eligible for short-term disability leave for up to 26 weeks at 100% or 50% of base pay as well as Long-Term Disability insurance.
  • Shell also offers other compensation such financial reimbursement for adoption, wellness, education, and personal learning expenses, and some roles are eligible for discretionary long-term incentives.
  • For interns, eligible benefits include medical, dental, and vision coverage, life insurance, Business Travel Accident Insurance, and Occupational Accidental Death Benefit programs; participation in a 401(k) plan; and paid leave for up to 11 paid holidays.
  • Benefit from flexible working hours, and the possibility of remote/mobile working
  • Perform at your best with a competitive starting salary and annual performance related salary increase – our pay and benefits packages are among the best in the world.
  • Take advantage of paid parental leave, including for non-birthing parents
  • Join an organization working to become one of the most diverse and inclusive in the world.
  • Gain access to a wide range of training and development programs
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service