DevOps Software Engineer Level 3

Praxis EngineeringAnnapolis Junction, MD
36d

About The Position

The DevOps Software Engineer shall be responsible for the Operational and Maintenance (O&M) efforts including installation, configuration, integration, monitoring, and sustaining of a large multi-tenant containerized Kubernetes High Performance Computing as a service (HPCaaS) platform for a large Linux computing environment.

Requirements

  • Experience using the Linux CLI
  • Experience developing and maintaining scripts using Bash/Python
  • Experience developing with Python and Java in a Linux environment
  • General HPC technical knowledge regarding compute, network, memory, and storage system components
  • Experience installing, configuring, and supporting COTS/GOTS/FOSS software, libraries, and packages in a Linux environment
  • Experience with containerization technologies such as Docker and containerd
  • Experience with container orchestration technologies such as Kubernetes
  • Experience administering Kubernetes clusters on bare metal in a Linux environment
  • Experience with IaC (Infrastructure as Code) concepts, principles and automation tools such as Ansible and Terraform
  • Experience with CI/CD principles, methodologies, and tools such as GitLab CI
  • Experience with Git Version Control System
  • Master’s degree in computer science or related discipline from an accredited college or university, plus five (5) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity.
  • Bachelor’s degree in computer science or related discipline from an accredited college or university, plus seven (7) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity
  • Nine (9) years of experience as a SWE, in programs and contracts of similar scope, type, and complexity.
  • Active TS/SCI with an appropriate polygraph is required to be considered for this role

Nice To Haves

  • Familiar with Site Reliability Engineering (SRE) principles and applications
  • Experience with the Atlassian Tool Suite (JIRA, Confluence)
  • Experience using system monitoring tools such as Grafana/Prometheus

Responsibilities

  • Operational and Maintenance (O&M) efforts including installation
  • configuration
  • integration
  • monitoring
  • sustaining of a large multi-tenant containerized Kubernetes High Performance Computing as a service (HPCaaS) platform for a large Linux computing environment.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service