Bare Metal Kubernetes Engineer

EverpureSanta Clara, CA
4hOnsite

About The Position

We’re in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry. Here, you lead with innovative thinking, grow along with us, and join the smartest team in the industry. This type of work—work that changes the world—is what the tech industry was founded on. So, if you're ready to seize the endless opportunities and leave your mark, come join us. You will architect the backbone of Everpure’s engineering excellence by designing and operating massive-scale bare-metal Kubernetes environments. As a senior leader within Infrastructure Shared Services (ISS), you’ll bridge the gap between hardware and high-performance software, ensuring our global R&D teams have the reliable, secure platforms needed to build world-class products. This is a high-impact mission where your expertise in on-prem automation and cluster orchestration directly accelerates our product innovation.

Requirements

  • Kubernetes Mastery: Deep technical command of Kubernetes internals (API server, etcd, scheduler) and a proven track record of managing production clusters on bare-metal infrastructure.
  • Infrastructure Automation & Code: Proficiency in driving "Infrastructure as Code" using tools like Ansible or Terraform, combined with the ability to build custom tooling or integrations in Go or Python.
  • Advanced Networking & Linux Systems: Expert-level knowledge of Linux systems administration and container networking, including hands-on experience with CNI plugins, BGP, and spine-leaf architectures.
  • Observability & Reliability Mindset: Experience building comprehensive monitoring and logging frameworks (ELK, Prometheus) and a commitment to root-cause analysis and incident prevention.
  • Technical Leadership: The ability to translate complex business requirements into scalable technical designs and communicate those strategies effectively to both stakeholders and peers.

Responsibilities

  • Architect Production-Grade Ecosystems: Lead the design and deployment of large-scale bare-metal clusters, integrating control planes with Portworx and Pure Storage arrays to deliver high-performance persistent storage.
  • Scale Network Infrastructure: Own the implementation of advanced cluster networking (Cilium, BGP, L4/L7 load balancing) to ensure seamless, low-latency communication across multi-rack and multi-site topologies.
  • Drive Operational Excellence via GitOps: Build and maintain automated, self-healing workflows using ArgoCD and CI/CD pipelines to manage cluster lifecycles, ensuring zero-touch deployments and consistent platform health.
  • Guarantee Reliability and Security: Define and meet rigorous SLIs/SLOs by engineering robust observability stacks (Prometheus, Grafana) and enforcing airtight security through RBAC, OIDC, and network isolation.
  • Collaborate and Mentor: Partner with internal business units to onboard complex workloads—like KubeVirt or GitHub Actions runners—while elevating the technical bar for the team through design reviews and mentorship.

Benefits

  • Pure Innovation: We celebrate those who think critically, like a challenge, and aspire to be trailblazers.
  • Pure Growth: We give you the space and support to grow along with us and to contribute to something meaningful. We have been named Fortune's Best Workplaces in Technology™, Fortune's Best Workplaces in the Bay Area™, and certified as a Great Place to Work®!
  • Pure Team: We build each other up and set aside ego for the greater good.
  • flexible time off
  • wellness resources
  • company-sponsored team events
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service