Engineer III, Platform Engineering

OmnicellCranberry Township, PA
Remote

About The Position

As an Engineer III on the Platform Infrastructure team, you are a hands-on individual contributor who builds, improves, and operates cloud-native infrastructure that supports Omnicell’s Cloud Platform. This role emphasizes independent execution, strong ownership, and close collaboration with development teams. The Engineer III, Platform Engineering will design, implement, and troubleshoot cloud infrastructure supporting the Omnisphere Cloud Platform, build and evolve infrastructure-as-code using Terraform and related automation tools, and design, enhance, and implement CI/CD pipelines that enable reliable, repeatable deployments. This role also involves partnering closely with application teams to ensure services are production-ready, scalable, and secure, and continuously improving platform reliability, performance, and operational efficiency. Additionally, the role includes participating in incident response, leading or contributing to root cause analysis and corrective action planning, and improving observability through alerting, dashboards, and modern monitoring practices. Support for on-call rotations and driving a culture of operational excellence are also key aspects. The role also involves actively participating in cloud cost optimization and FinOps practices, contributing to regular cost reviews, and identifying opportunities to improve efficiency. Using data and tooling to balance reliability, performance, and cost is essential. Finally, the role requires applying essential AI literacy to daily engineering work, including AI-assisted coding, automation, and troubleshooting, and leveraging AI and AIOps techniques to improve operational outcomes, anomaly detection, and developer productivity.

Requirements

  • 5+ years of experience in software engineering
  • Extensive experience with infrastructure deployment on AWS
  • Deep knowledge of infrastructure automation tools such as Terraform & Terragrunt
  • Proficiency in Python or an object-oriented programming language for automation & tooling
  • Extensive knowledge of containerization
  • Strong experience with Kubernetes administration and troubleshooting using AWS EKS
  • Experience with best practices for deploying, running, and observing workloads in Kubernetes and Helm
  • Essential AI literacy, including core programming and AIOps practices for troubleshooting resources
  • Experience with tools such as Kafka & PostgreSQL
  • Deep knowledge of Linux administration
  • Extensive experience with CI/CD workflows and best practices
  • Experience with GitOps tools such as ArgoCD
  • Experience with infrastructure optimization and cost management
  • Experience within incident management processes, including incident commander role, directing root cause analysis, and on-call rotations
  • Experience with modern observation techniques and platforms, such as OpenTelemetry and AWS CloudWatch
  • Knowledge with NiFi and ClickHouse databases
  • Experience in FinOps best practices

Responsibilities

  • Design, implement, and troubleshoot cloud infrastructure supporting the Omnisphere Cloud Platform
  • Build and evolve infrastructure-as-code using Terraform and related automation tools
  • Design, enhance, and implement CI/CD pipelines that enable reliable, repeatable deployments
  • Partner closely with application teams to ensure services are production-ready, scalable, and secure
  • Continuously improve platform reliability, performance, and operational efficiency
  • Participate in incident response, including serving as incident commander when needed
  • Lead or contribute to root cause analysis and corrective action planning
  • Improve observability through alerting, dashboards, and modern monitoring practices
  • Support on-call rotations and help drive a culture of operational excellence
  • Actively participate in cloud cost optimization and FinOps practices
  • Contribute to regular cost reviews and identify opportunities to improve efficiency
  • Use data and tooling to balance reliability, performance, and cost
  • Apply essential AI literacy to daily engineering work, including AI-assisted coding, automation, and troubleshooting
  • Leverage AI and AIOps techniques to improve operational outcomes, anomaly detection, and developer productivity
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service