Senior CloudOps Engineer - Life Sciences

CapgeminiDallas, TX
Onsite

About The Position

Capgemini is seeking a highly skilled Senior CloudOps Engineer for an onsite role in Plano, TX to design, build, and operate resilient, scalable, and secure multi-cloud platforms across Azure and Google Cloud. This role blends deep cloud infrastructure expertise with modern DevOps/SRE practices and an emerging focus on applying AI and large language models (LLMs) to cloud operations, automation, diagnostics, and observability. The ideal candidate is a self-starter who owns initiatives end-to-end, applies an automation-first mindset, and partners closely with engineering and product teams to drive platform reliability, performance, and continuous improvement.

Requirements

  • 3+ years experience operating production-grade cloud platforms in Azure and Google Cloud
  • Strong hands-on experience with Kubernetes, including AKS and GKE
  • Advanced experience with Infrastructure as Code using Terraform, including module design and reuse
  • Solid understanding of cloud networking concepts (VPC/VNET design, routing, peering, VPNs, DNS)
  • Experience implementing zero-downtime deployment and upgrade strategies
  • Strong automation skills using Python, Bash, and/or PowerShell
  • Experience building and maintaining CI/CD pipelines
  • Deep familiarity with monitoring, logging, alerting, and observability tools
  • Demonstrated ability to troubleshoot complex infrastructure and performance issues
  • Working knowledge of SRE and DevOps principles and operational best practices
  • Excellent communication skills with the ability to collaborate across teams
  • Self-starter mindset with a track record of owning initiatives from design through operation

Nice To Haves

  • Interest in or hands-on experience applying AI/LLMs to operational workflows is strongly preferred

Responsibilities

  • Apply LLMs and AI-driven techniques to cloud operations, automation, diagnostics, and observability
  • Stay current with emerging AI-enabled DevOps and SRE tools and practices
  • Maintain overall cloud operational health, including uptime, performance, and stability, across Azure and Google Cloud environments
  • Design, build, deploy, and operate Kubernetes platforms (AKS and GKE) with zero-downtime deployment and upgrade strategies
  • Design and manage multi-cloud infrastructure using Terraform
  • Create and maintain reusable, secure, and scalable Terraform modules
  • Design, implement, and troubleshoot cloud networking, including VNETs/VPCs, routing, peering, VPNs, and DNS
  • Diagnose and resolve complex network, connectivity, performance, and platform issues
  • Automate infrastructure provisioning, deployments, and operational tasks using Python, Bash, or PowerShell
  • Build, maintain, and improve CI/CD pipelines and operational workflows
  • Implement monitoring, logging, alerting, and observability solutions to proactively identify issues
  • Perform root cause analysis and drive remediation and prevention strategies
  • Ensure platforms are scalable, resilient, secure, and performant by design
  • Develop and maintain a deep understanding of platform architecture, dependencies, and system interactions
  • Drive operational best practices, standards, and continuous improvement initiatives
  • Collaborate closely with engineering and product teams to support delivery goals
  • Act as a technical leader with strong communication, ownership, and problem-solving skills

Benefits

  • Paid time off based on employee grade (A-F), defined by policy: Vacation: 12-25 days, depending on grade, Company paid holidays, Personal Days, Sick Leave
  • Medical, dental, and vision coverage (or provincial healthcare coordination in Canada)
  • Retirement savings plans (e.g., 401(k) in the U.S., RRSP in Canada)
  • Life and disability insurance
  • Employee assistance programs
  • Other benefits as provided by local policy and eligibility
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service