Platform Engineer, Kubernetes Platform Engineering (KPE Features)

Intercontinental Exchange Holdings, Inc.Atlanta, GA
Onsite

About The Position

The Kubernetes Platform Engineering (KPE) team builds and operates ICE's internal container orchestration platform powered by Red Hat OpenShift. KPE Features — the Solutions Engineering sub-team — serves as the primary interface between the platform and its consumers across ICE's various business units. We are looking for a Mid-Level Platform Engineer to join KPE Features. In this role, you will design, implement, and support platform capabilities that enable development teams to deploy and operate workloads at enterprise scale. You'll work across the full platform stack — from GitOps pipelines and policy enforcement to observability infrastructure — while acting as a technical resource for internal platform consumers.

Requirements

  • 3+ years of experience in platform engineering, DevOps, or a related discipline
  • Hands-on experience with Kubernetes in a production environment; OpenShift experience strongly preferred
  • Experience with Kubernetes configuration and templating tools such as Kustomize or Helm
  • Comfort operating in Linux environments and working with YAML-heavy configuration management
  • Strong written communication skills — this team writes documentation and communicates changes to a broad consumer base
  • Deep proficiency in at least one of the following disciplines, with general familiarity across the others: K8s Observability — Grafana dashboards, Prometheus alerting pipelines, distributed tracing (Tempo, Jaeger), log aggregation (Loki, Vector, Cortex), and related logging/monitoring practices
  • Deep proficiency in at least one of the following disciplines, with general familiarity across the others: K8s GitOps & Release Engineering — GitHub, GitHub Actions, ArgoCD, Kustomize, Helm, and Git-based delivery workflows
  • Deep proficiency in at least one of the following disciplines, with general familiarity across the others: K8s Infrastructure Engineering — Kubernetes on bare metal, cluster networking (CNI, OVN-Kubernetes, network policy), storage and CSI drivers, and hardware-layer operations
  • Deep proficiency in at least one of the following disciplines, with general familiarity across the others: K8s Security & Policy Enforcement — Policy-as-code tooling (Kyverno, OPA/Gatekeeper), admission webhooks, and container security practices

Nice To Haves

  • Red Hat OpenShift Certified Administrator (EX280) or equivalent certification
  • Experience supporting internal platform consumers or operating in a shared-services model
  • Ansible and Red Hat Ansible Automation Platform (AAP)
  • Red Hat Advanced Cluster Management (ACM)
  • Red Hat Advanced Cluster Security (ACS)
  • Experience with HyperShift or OpenShift Hosted Control Planes
  • Podman and container image build/management workflows
  • Experience developing custom Kubernetes operators

Responsibilities

  • Deliver platform features and enhancements, working from design through deployment and documentation
  • Support onboarding of new teams and workloads onto the platform via self-service and GitOps-based provisioning workflows
  • Maintain and evolve ArgoCD-based GitOps pipelines and Kustomize overlay configurations across multiple clusters
  • Implement and tune Kyverno admission control policies to enforce security and operational standards across the platform
  • Build and maintain observability tooling — including Grafana dashboards, Tempo distributed tracing, and alerting pipelines — to give platform consumers and the KPE team meaningful operational insight
  • Collaborate with KPE-Infrastructure and KPE-Enablement sub-teams on cluster lifecycle events, upgrades, and cross-cutting platform concerns
  • Engage with platform consumers and stakeholders to understand business requirements and translate them into scalable technical solutions
  • Contribute to platform documentation, runbooks, and internal knowledge bases on Confluence
  • Participate in incident response and root cause analysis for platform-layer issues
  • Act as third-level escalation support for complex platform-layer incidents
  • Participate in an on-call rotation to ensure platform availability and rapid response to critical issues

Benefits

  • healthcare coverage (medical, dental and vision)
  • a 401(k) plan
  • life insurance
  • time off
  • paid leave for qualifying circumstances

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

1-10 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service