Lead Site Reliability Administrator

Open Text Inc.Alpharetta, GA
11h

About The Position

OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that shape the future of digital transformation. AI-First. Future-Driven. Human-Centered. At OpenText, AI is at the heart of everything we do—powering innovation, transforming work, and empowering digital knowledge workers. We are hiring talent AI can't replace to help us shape the future of information management. An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. At Opentext, Site Reliability Engineer position is part of the technical team that involves complete ownership of containerized platforms delivery including administration and management of the OpenText containerized infrastructure stack for our worldwide customers both at on-premise as well as on public hyperscalers.You will be joining a growing team that provides world-class operational support including hands-on troubleshooting and administration to a variety of enterprise customers. You will be required to collaborate with cross-functional teams to ensure that Service Levels are met, and customer satisfaction is achieved. This hands-on role will focus on designing , maintaining and troubleshooting softwaare features and solutions as well as hardening processes for cloud environments and container-based applications. The successful candidate will have responsibility to ensure industry best practices and methodologies are applied to the design, deployment, and operation of our private and public Cloud infrastructure. This position identifies security vulnerabilities, maintenance issues, monitoring and observability as well as opportunities for improvements. This role requires a deep technical knowledge of containerized technologies and must have a solid understanding of cloud offerings of major hyperscalers as well as on premise infrastructure.

Requirements

  • BS/MS in Computer Science or related field; GCP or AWS certifications preferred.
  • Experience with virtualization technologies (e.g.,VMware/OpenStack) and enterprise storage (e.g., NetApp).
  • Hands-on experience with GitOps tooling such as ArgoCD, GitLab CD, ACM, Tekton, etc.
  • Proven experience automating infrastructure via code.
  • 5+ years of hands-on software engineering, SRE, or cloud operations experience focusing on cloud and container platforms.
  • Proficiency in scripting languages: Python, Bash, or PowerShell.
  • Knowledge of Go is a strong plus.
  • Expert-level experience with Ansible and Terraform.
  • Demonstrated ability to design and implement DevOps toolchains in large, complex organizations.
  • Experience running Kubernetes across multiple flavors:kubeadm, GKE, EKS, BOSH CFCR, Cluster-api and other enterprise distributions.
  • Experience with Anthos / Tanzu is appreciated.
  • Strong background working with Linux environments.
  • Ability to communicate effectively with technical and non-technical stakeholders across all organizational levels.

Nice To Haves

  • Experience with service mesh technologies (Istio, Linkerd).
  • Knowledge of policy-as-code frameworks (OPA/Gatekeeper, Kyverno).
  • Hands-on experience with multi-cluster or multi-cloud networking (Cilium, Calico).
  • Building or operating Kubernetes operators and custom CRDs.
  • Familiarity with FinOps or cloud cost optimization for containerworkloads.
  • Experience with backup/restore and disaster recovery tooling (Velero, Restic).
  • Exposure to security hardening and compliance frameworks relevant to cloud platforms.
  • GKE/AKS/EKS knowledge
  • On prem orchestrators like Tanzu and Anthos.
  • We are shifting from event-based to proactive, log-based monitoring, and deep experience here is highly appreciated.

Responsibilities

  • Running and supporting applications on containerized infrastructure across private datacenters and public cloud platforms.
  • Planning, testing, and implementing monitoring, alerting, and observability for containerized services.
  • Deploying, managing,and optimizing container security solutions (e.g., Prisma Cloud /RedLock / Twistlock).
  • Building automation and security solutions that enable DevSecOps CI/CD pipelines.
  • Working with core cloud-native technologies: Kubernetes, Docker,GitOps, CI/CD, IaC, PaaS.
  • Operating and designing solutions across hyperscalers: GCP, AWS,Azure.
  • Creating and maintaining standards, policies, procedures, and best-practice documentation.
  • Automating workflows to align with ITIL, compliance, and internal change management processes.
  • Leading platform initiatives, influencing engineering practices,and mentoring engineers.

Benefits

  • At OpenText, we offer a thoughtfully designed benefits package that supports your physical, emotional, and financial wellbeing.
  • vacation entitlement
  • paid time off
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service