Senior DevSecOps / Platform Engineer - Agentic AI

PeratonAnnapolis Junction, MD
$112,000 - $179,000

About The Position

Peraton Labs is seeking a Senior DevSecOps / Platform Engineer to own the Agentic AI platform end-to-end across delivery, infrastructure, runtime operations, and platform health. This role will be responsible for building and operating the technical backbone that enables secure, reliable, and scalable platform delivery, from CI/CD pipelines and automated testing through EKS cluster operations, AWS infrastructure, monitoring, alerting, and synthetic health checking. The ideal candidate combines deep technical execution with strong operational judgment and a builder mindset, with the ability to design systems that are not only functional, but supportable, observable, secure, and audit-ready. This is an opportunity to own the platform layer of a highly visible and ambitious Agentic AI environment. You will work on meaningful problems that require strong technical judgment, operational rigor, and modern engineering practices. For the right candidate, this role offers the chance to build real systems, improve platform resilience, and directly influence how advanced AI-enabled capabilities are delivered in secure, compliant, and production-ready ways.

Requirements

  • Minimum of BS with 8+ years of experience, MS with 6+ YoE, or PhD with 3+ YoE in platform engineering, DevOps, SRE, or closely related infrastructure engineering roles
  • Deep CI/CD experience with GitHub Actions and/or GitLab CI
  • Strong hands-on background operating Amazon EKS / Kubernetes environments
  • Senior-level AWS experience across core services including VPC, IAM, S3, RDS, CloudTrail, and KMS
  • Strong experience building and operating monitoring, logging, and alerting solutions at scale using CloudWatch, Prometheus/Grafana, or equivalent
  • Hands-on infrastructure-as-code experience using OpenTofu / Terraform
  • Experience supporting agile delivery models, including trunk-based development, MR-driven change, and automated quality gates
  • Strong security and compliance engineering foundation, including secrets management, least privilege, supply chain integrity, and audit evidence support
  • Experience implementing automated testing in CI/CD pipelines
  • Experience running DAST and security testing against live services in pipeline workflows, including triage and gating of findings alongside SAST/SCA results
  • Experience building synthetic and transactional monitoring, including scripted health checks that validate real auth flows and critical transactions
  • Strong troubleshooting, systems thinking, and operational ownership mindset
  • US Citizenship is a requirement for this position

Nice To Haves

  • Experience implementing BigBang / Platform One
  • Familiarity with Iron Bank, Sigstore/Cosign, and hardened image pipeline practices
  • Prior experience supporting systems pursuing or maintain an ATO
  • Experience with contract testing such as PACT
  • Experience with chaos engineering concepts or tooling
  • Experience with progressive delivery approaches such as canary or blue/green deployments with automated rollback based on health-check failure
  • Experience working in regulated or mission environments aligned to NIST 800-171, CMMC, FedRAMP, or DoD security expectations
  • Background helping mature platform reliability and security in cloud-native environments supporting sensitive workloads

Responsibilities

  • Own and operate the Agentic AI platform end-to-end across CI/CD, cloud infrastructure, Kubernetes operations, observability, and runtime reliability
  • Design, build, and maintain CI/CD pipelines across GitHub Actions and/or GitLab CI, enabling secure, repeatable, and efficient delivery workflows
  • Implement and improve automated testing and quality gates within the software delivery lifecycle, including build validation, integration checks, security testing, and deployment controls
  • Lead operational ownership of Amazon EKS / Kubernetes environments, including cluster lifecycle management, upgrades, troubleshooting, RBAC, Helm-based deployments, pod identity, and GitOps-aligned workflows
  • Build and maintain AWS infrastructure supporting platform and application needs across services such as VPC, IAM, S3, RDS, CloudTrail, and KMS
  • Own infrastructure provisioning and lifecycle management through OpenTofu / Terraform and related infrastructure-as-code practices
  • Design and operate a mature monitoring, logging, and alerting stack using CloudWatch, Prometheus/Grafana, or equivalent tooling
  • Develop actionable alerts, service health indicators, and SLO-aligned operational thresholds that support fast triage and resilient service delivery
  • Build and maintain synthetic and transactional monitoring that exercises real authentication flows, user journeys, and critical service transactions
  • Implement and maintain DAST and runtime security testing in delivery pipelines using tools such as OWASP ZAP, Burp, Nuclei, or equivalent
  • Support security-focused CI/CD practices including secrets management, least privilege, software supply chain integrity, and audit evidence generation
  • Drive modern engineering delivery patterns such as trunk-based development, merge-request-driven change, and automated release quality controls
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service