Platform Reliability Engineer

RTXAurora, CO
Hybrid

About The Position

The platform reliability engineer role will report into the Digital Infrastructure Services organization to design, maintain and implement Raytheon business wide orchestration and container management platforms based on Kubernetes to support program software, solutions, and products. The primary responsibilities include implementing, supporting, and optimizing Kubernetes based container orchestration platforms across unclassified and closed area systems. Additional responsibilities include partnering within the team and across functions, engineering, and program teams and personnel to optimize the use of the platforms and products as well as bring forward new ideas, concepts, and capabilities for future platform enhancements. This roles responsibility includes working directly with partner’s diagnosing and solving complex Kubernetes issues. Implementing and improving observability and monitoring tools to provide error detection, defect elimination, improve MTTD/MTTR, and overall service availability and customer satisfaction. Escalating larger issues to the Platform Engineers for further investigation and following the process to resolution. Our Raytheon Orchestration and Container Kubernetes Service (ROCKS) team is responsible for providing a secure, standard, ROCKS solid foundation for the Digital Ecosystem at Raytheon. ROCKS is a Platform Engineering team that provides an integrated container management platform built on Kubernetes and a set of managed services. ROCKS provides a modern cloud native container orchestration platform that can be installed on air-gapped classified information systems or can be leveraged as a non-production service in shared unclassified platforms across our networks.

Requirements

  • Typically requires a University degree or equivalent experience and a minimum of 5 years of prior relevant experience or an Advanced Degree in a related field and minimum 3 years experience.
  • Active and transferable U.S. government issued Secret security clearance is required prior to start date.
  • U.S. citizenship is required, as only U.S. citizens are eligible for a security clearance.
  • Experience installing, deploying, monitoring, and supporting Kubernetes clusters in on-premises and cloud infrastructure , working with Rancher RKE2, Upstream, OpenShift Container Platform, VMWare VKS/Tanzu or other leading Kubernetes platforms.
  • Experience with Kubernetes based development tools such as Terraform, Helm, Python, Go, and Bash.
  • Experience working with observability and monitoring software like Grafana, Prometheus, Alert Manager, and Loki.

Nice To Haves

  • Experience on deploying new cloud native platforms and systems in classified and/or unclassified work environments.
  • Experience designing and operating highly scalable, secure, high performing systems, platforms, and Kubernetes clusters.
  • Experience working with VMWare, AWS GovCloud, and Azure for Government.
  • Experience contributing and execute projects on time and budget.
  • Experience translating Business and Function demands to technical requirements and tasks
  • Experience clearly documenting and diagraming technical systems.
  • Experience working with cloud native computing foundation Kubernetes components including service mesh, service discovery, package management, observability and monitoring, runtimes, and security.
  • Experience with GitOps and using package management technologies with Kubernetes cluster management and operations such as ArgoCD, Packer, Helm, and Kustomize.
  • Experience working with agile teams on product mode approaches and partnering with product owners and scrum masters to align and execute work
  • Implementing Kubernetes on air-gapped and regulated networks and environments.
  • Constant vigilance when investigating and finding root cause of distributed system malfunctions.
  • Experience anticipating or adopting new innovations and advancements in the CNCF landscape to enhance the effectiveness of our products.

Responsibilities

  • Work autonomously and partner with Raytheon programs, engineers, and Digital Technology peers to understand their needs and solve complex problems
  • Partner with teams managing infrastructure, networking, and application development to forecast capacity, scaling, and demand.
  • Implementing and improving observability and monitoring tools to provide error detection, defect elimination, improve MTTD/MTTR, and overall service availability and customer satisfaction.
  • Escalating larger issues to the Platform Engineers for further investigation and following the process to resolution.

Benefits

  • medical
  • dental
  • vision
  • life insurance
  • short-term disability
  • long-term disability
  • 401(k) match
  • flexible spending accounts
  • flexible work schedules
  • employee assistance program
  • Employee Scholar Program
  • parental leave
  • paid time off
  • holidays
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service