About The Position

This role will support one or more direct or indirect contracts with the U.S. Federal Government which, due to federal government security requirements, mandates that all Workday personnel working on the contracts be United States citizens (naturalized or native). The Workday Sana Search Team is responsible for creating the world’s most powerful, platform-agnostic enterprise search product, transforming the way people and agents interact with knowledge, inside and outside the Workday ecosystem. We are in the process of building the definitive discovery platform: a standalone-ready, hybrid-by-design service that orchestrates enterprise data at scale. From federated gateways to agentic text retrieval pipelines and open personalization frameworks, we provide the blueprints for modern search, engineered for universal portability and precision. Joining our team means embarking on a journey of opportunity to advance your career and contribute to impactful solutions that shape industries. Whether you thrive with solving sophisticated business problems, collaborating with agile teams, or championing innovation and software design, Workday offers an environment where your talents can thrive.

Requirements

  • 5+ years of experience in DevOps, Site Reliability Engineering, or Platform Engineering.
  • 5+ years of experience with AWS (Compute, Storage, Networking, and Control Plane).
  • Must be a U.S. Citizen (required for Federal Government contract compliance).

Nice To Haves

  • Experience managing production workloads in Kubernetes.
  • Deep familiarity with CI/CD tools and IaC frameworks.

Responsibilities

  • Provision and manage AWS resources (EC2, Lambda, ElastiCache, S3, RDS) using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
  • Build platforms and tools that empower application developers to interact with production in a self-service manner.
  • Manage Docker images and Kubernetes manifests (using Kustomize/Helm) to support and scale microservices.
  • Define, design, implement, test, and deploy automation infrastructure for configuration management and service deployment to improve operational efficiency.
  • Support and troubleshoot CI/CD pipelines (e.g., Jenkins, TeamCity, Argo CD), ensuring builds are fast and deployments are reliable.
  • Drive the "commit to production" workflow, automating manual touchpoints where reasonable to help scale the team.
  • Configure CloudWatch, Prometheus, and ELK dashboards to ensure team visibility into system health.
  • Triage, fix, and resolve issues identified by production monitoring.
  • Conduct retrospectives and act on incidents to continually improve systems.
  • Participate in an infrequent on-call rotation to ensure high availability for critical systems.
  • Build and maintain strong relationships with peers and partners; work closely with developers to debug environment-specific issues and optimize application performance.
  • Maintain clear, concise documentation for deployment processes, infrastructure diagrams, and reliability practices.
  • Engage in a culture of learning and innovation through hackathons, online course offerings, and employee-led special interest guilds.

Benefits

  • Workday Bonus Plan or a role-specific commission/bonus
  • Annual refresh stock grants
  • Comprehensive benefits
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service