Modeling & Simulation Orchestration/Kubernetes Engineer

Torch Technologies, Inc.Huntsville, AL
Onsite

About The Position

The Modeling & Simulation (M&S) Orchestration/Kubernetes Engineer supports testing of complex systems within distributed Hardware-in-the-Loop (HWIL) and cloud-based simulation environments for the Missile Defense Agency (MDA). This role focuses on containerization, orchestration, and DevOps practices to ensure scalable, secure, and reliable deployment of simulation and test applications in support of mission-critical software development and test events.

Requirements

  • U.S. Citizenship
  • Bachelor’s degree in Computer Science, Engineering, Information Systems, or related field (equivalent professional experience considered)
  • 2+ years of related experience.
  • Active SECRET security clearance with ability to obtain and maintain TS/SCI
  • Experience supporting Kubernetes-based environments
  • Experience with Docker and container orchestration (Kubernetes, EKS, GKE, or OpenShift)
  • Experience developing or maintaining CI/CD pipelines
  • Experience with Infrastructure as Code (Terraform, Ansible, or similar)
  • Strong scripting skills (Python, Bash, or similar)
  • Knowledge of Linux/UNIX environments and command-line tools
  • Experience with Git and collaborative development workflows

Nice To Haves

  • Certified Kubernetes Administrator (CKA) or similar certification
  • Experience working in cloud environments (AWS, Azure, or GCP)
  • Experience supporting distributed or high-performance computing environments
  • Familiarity with GPU-enabled workloads and CUDA architecture
  • Experience integrating ML/AI workloads into containerized environments
  • Experience building or supporting real-time or streaming data pipelines
  • Familiarity with monitoring and observability best practices
  • Experience with Linux development environments
  • Experience with Docker and advanced container security practices
  • Familiarity with large-scale data processing or distributed systems frameworks

Responsibilities

  • Design, build, and maintain scalable, resilient Kubernetes clusters for simulation and test environments
  • Deploy and manage containerized applications using Docker and Kubernetes, ensuring high availability and performance
  • Develop and automate CI/CD pipelines using tools such as Jenkins, GitLab CI, or Azure DevOps
  • Implement Infrastructure as Code (IaC) using Terraform, Ansible, or similar tools to provision and manage cloud and on-prem resources
  • Monitor cluster health, system performance, and application metrics using tools such as Prometheus, Grafana, and Splunk
  • Troubleshoot infrastructure and application issues in real-time during test events
  • Collaborate with development teams to streamline containerization and promote DevOps best practices
  • Implement and maintain security controls including network policies, RBAC, vulnerability scanning, and compliance enforcement
  • Support distributed simulation events, including occasional off-hours test activities

Benefits

  • ESOP participation
  • 401(k) match and safe-harbor contribution
  • medical, dental, vision, life insurance
  • short-term disability
  • long-term disability
  • flexible spending accounts
  • Health Saving Accounts and Health Reimbursement Accounts
  • EAP
  • education assistance
  • paid time off
  • holidays
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service