We’re looking for a Cloud DevOps & AI Ops Engineer who fully owns the infrastructure and operational lifecycle for our platform — from code deployment to production AI systems. You take end-to-end responsibility for how systems are built, deployed, scaled, and maintained in production. This is not a maintenance-only role. You will: Diagnose issues across cloud infrastructure, data pipelines, and AI systems Design and operate CI/CD pipelines for fast, reliable releases Build and manage scalable infrastructure on GCP using Terraform Implement and support AI-powered workflows, including LLMs and agent-based systems Monitor, debug, and optimize production systems across infrastructure and AI workloads You are both the infrastructure architect and the hands-on engineer, ensuring our systems — including AI — run reliably in production. This is a high-impact hire. You’ll define how infrastructure and AI systems operate at scale — establishing best practices, building automation, and shaping how engineering teams leverage AI in production. This role is based in Salt Lake City and reports to the VP of Engineering. What Success Looks Like In Your First 30 Days Audit existing infrastructure, CI/CD pipelines, and deployment workflows Understand current data pipelines, ML/LLM usage, and AI workflows Identify reliability risks, bottlenecks, and gaps in automation and observability Document system architecture and operational standards Propose improvements to increase stability, speed, and AI system reliability In Your First 90 Days Improve CI/CD pipelines to enable faster, safer deployments Deploy and manage infrastructure using Terraform and GCP best practices Implement monitoring and alerting across infrastructure and AI systems Support and productionize AI workflows and LLM-powered features Reduce manual work through automation and reusable tooling In Your First Year Build a scalable, repeatable infrastructure and AI operations framework Improve uptime, deployment frequency, and system reliability Establish DevOps and AI Ops best practices across engineering Enable reliable deployment of AI systems and agent workflows Serve as the go-to expert for infrastructure, performance, and AI system operations
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed