High Performance Computing System Administrator

Caterpillar Inc.Mossville, IL
16hOnsite

About The Position

Your Work Shapes the World at Caterpillar Inc. When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it. Job Summary: Caterpillar Virtual Product Development (VPD) Systems & Platforms team lives at the intersection of engineering and information technology. One of the team’s major responsibilities is enterprise ownership of High-Performance Computing (HPC) capability for engineering modeling and simulation. The HPC operations team has an opening for an engineer who will be responsible for HPC System administration of On-Premise and Cloud-based Linux computing technical infrastructure. This role is part of a global distributed team that shares responsibility for achieving excellence in its operational metrics for performance, availability, and user support.

Requirements

  • Problem Solving: Knowledge of approaches, tools, techniques for recognizing, anticipating, and resolving organizational, operational or process problems; ability to apply knowledge of problem solving appropriately to diverse situations.
  • Application Design, Architecture: Knowledge of basic activities and deliverables of application design; ability to utilize application design methodologies, tools and techniques to convert business requirements and logical models into a technical application design.
  • System and Technology Integration: Knowledge of the features and facilities of systems; ability to integrate and communicate among applications, databases and technology platforms.
  • System Testing: Knowledge of system and software testing; ability to design, plan and execute system testing strategies and tactics to ensure the quality of software at all stages of the system life cycle.

Nice To Haves

  • Typically 2+ years’ experience in administration of heterogeneous IT compute and storage infrastructure
  • Extensive knowledge of Linux operating systems
  • Strong Scripting capability in one or more languages – Python, powershell, shell/bash,etc, Azure/Gitlab Dev-ops CICD pipelines
  • Knowledge of TCP/IP fundamentals
  • Demonstrated experience and relevant certifications with cloud-based computing resource deployment (Azure, AWS).
  • Working knowledge of distributed/parallel file systems and storage appliances (Isilon, Netapp, Qumulo, etc)
  • Experience with HPC deployment and middleware technologies (Bright Cluster manager, Altair PBS Pro, SLURM, Torque MOAB)

Responsibilities

  • Configuration, deployment, and maintenance of the Linux Cluster Hardware and HPC Software applications suite, associated Storage, and network infrastructure.
  • Administration of the teams Hosting and management systems that enables the HPC.
  • Provide technical support and troubleshooting for end users’ issues related to HPC hardware and Solver software applications, evaluate, and perform job performance and application testing.
  • Work on HPC Operational and Strategic Projects efforts, participate in User Group Forums
  • Ensure compliance to enterprise IT security and technology controls
  • Evaluation and implementation of new tools and methods for improved operations and service delivery

Benefits

  • Medical, dental, and vision benefits
  • Paid time off plan (Vacation, Holidays, Volunteer, etc.)
  • 401(k) savings plans
  • Health Savings Account (HSA)
  • Flexible Spending Accounts (FSAs)
  • Health Lifestyle Programs
  • Employee Assistance Program
  • Voluntary Benefits and Employee Discounts
  • Career Development
  • Incentive bonus
  • Disability benefits
  • Life Insurance
  • Parental leave
  • Adoption benefits
  • Tuition Reimbursement
  • These benefits also apply to part-time employees

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service