High Performance Computing Engineer, Sr

Lockheed MartinFort Worth, CA
1dRemote

About The Position

In light of current internal business pressures, priority consideration will be given to qualified candidates who are currently affiliated with the Enterprise Operations’ Enterprise Business & Digital Transformation (EBDT) organization Become part of the Future of IT at Lockheed Martin as a Full Stack Engineer within the FORCE Portfolio! This dynamic, fast-paced environment is embracing DevSecOps and Agile to enable our strategic goals. The Engineer role will be instrumental to the success of reinventing how we develop and maintain compute infrastructure products at Lockheed Martin to meet the needs of every business area. The FORCE Portfolio resides within the Enterprise IT Infrastructure and International (I2) Organization. The FORCE Portfolio includes (but is not limited to) development and operations for the following Product Teams: Compute IaaS (Virtualization, Server OS, OpenStack), PaaS,(Containers, Database Engines, Middleware Splunk), Storage, Data Center/Hardware, High Performance Computing (Simulation, AI/ML), Governance, Commercial Cloud Native Offerings, Service Management (Customer Portal, Job Scheduling). These solutions are built to meet global needs and include Data Center locations for on-premise and in public cloud. This Engineer role is aligned to a single Delivery Team within the HPC Product Team. The Delivery Team may be utilizing Scrum or Kanban agile frameworks. This Full Stack Engineer role is for the High Performance Computing (HPC) Delivery Team with a focus on AI Infrastructure. Engineer responsibilities include: Support the design and development of HPC and utility systems (computation, network, and storage) Support AI Infrastructure and the equivalent systems Demonstrated automation mindset, including the use of automation, AI and orchestration tools and scripting languages. Examples include Ansible, PowerShell, Terraform. Perform full stack engineering, including platform support, user software support, and manage queuing software to meet the computing needs of research projects Responsible for System Administration on multiple system platforms and hardware. Position supports multiple platforms which include small servers and large supercomputers Will be responsible for system installations, upgrades, configuration management, configurations, software installation, troubleshooting, user interface and support On-call support rotation will be required This role requires U.S. Citizenship This position is full-time telecommuting. Occasional travel (1-3 times a year) may be requested. In light of current internal business pressures, priority consideration will be given to qualified candidates who are currently affiliated with the Enterprise Operations’ Enterprise Business & Digital Transformation (EBDT) organization. What’s In It For You From onsite to remote, we offer flexible work schedules to comprehensive benefits investing in your future and security, Learn more about Lockheed Martin’s comprehensive benefits package here. Do you want to be part of a company culture that empowers employees to think big, lead with a growth mindset, and make the impossible a reality? We provide the resources and give you the flexibility to enable inspiration and focus -if you have the passion and courage to dream big, work hard, and have fun doing what you love then we want to build a better tomorrow with you.

Requirements

  • Experience with hardware layer/engineering in the full stack
  • Demonstrated automation mindset, including the use of automation, AI and orchestration tools and scripting languages. Examples including Ansible, PowerShell, Terraform
  • Red Hat Enterprise Linux (RHEL) 7 or higher Administration and Configuration
  • US Citizenship required for this role
  • Experience with Kubernetes (foundational knowledge / experience)

Nice To Haves

  • Experience with High Performance Computing infrastructure product development and/or maintenance
  • Experience with AI infrastructure product development and/or maintenance
  • Experience with Splunk reporting
  • Experience using agile management tool such as JIRA, VersionOne, Pivotal Tracker, etc
  • Experience with simulation and AI/ML software
  • Experience with DevOps / DevSecOps
  • Knowledge of various protocols (i.e., DNS, SMTP, NFS, FTP, Telnet, SSH, SFTP)
  • System performance, disk I/O, and network tuning and configuration experience
  • Experience in mitigating IT Tech Debt and retiring legacy products and services
  • Demonstrated use of metrics to make data driven decisions
  • Familiarity with Service Now for ITSM
  • Familiarity with AWS and/or Azure IT service development and maintenance
  • Familiarity with private cloud on-premise IT service development and maintenance
  • Experience working in a virtual environment
  • Fiber Channel (Direct Attach) Storage Array Administration Experience
  • Experience with Trusted Multi-Level Security (MLS) Operating Systems
  • Familiarity with InfiniBand configuration and troubleshooting

Responsibilities

  • Support the design and development of HPC and utility systems (computation, network, and storage)
  • Support AI Infrastructure and the equivalent systems
  • Demonstrated automation mindset, including the use of automation, AI and orchestration tools and scripting languages. Examples include Ansible, PowerShell, Terraform.
  • Perform full stack engineering, including platform support, user software support, and manage queuing software to meet the computing needs of research projects
  • Responsible for System Administration on multiple system platforms and hardware.
  • Will be responsible for system installations, upgrades, configuration management, configurations, software installation, troubleshooting, user interface and support
  • On-call support rotation will be required

Benefits

  • Medical
  • Dental
  • Vision
  • Life Insurance
  • Short-Term Disability
  • Long-Term Disability
  • 401(k) match
  • Flexible Spending Accounts
  • EAP
  • Education Assistance
  • Parental Leave
  • Paid time off
  • Holidays

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service