Infrastructure Engineer

ShakudoMenlo Park, CA

About The Position

Shakudo is building the world's first operating system for data and AI. We are seeking an Infrastructure Engineer to join our Business Automation team to own and operate the internal systems, infrastructure, and AI Gateway product that power Shakudo at scale. This is a hands-on role for someone who thrives on keeping production systems reliable, secure, and fast. You will be responsible for everything from physical servers and DGX machines to CI/CD pipelines and customer-facing AI Gateway infrastructure. You will also contribute directly to product hardening, security, and DevOps practices across the platform.

Requirements

  • 8+ years of experience across software, data, platform, or AI engineering roles
  • 5+ years of strong experience with Kubernetes cluster operation and DevOps
  • Proficiency in Rust
  • Experience operating production infrastructure at scale, including physical servers, GPU clusters, and CI/CD systems
  • Strong background in security hardening, observability, and reliability engineering
  • Experience with AI/ML infrastructure, including LLM hosting and inference serving

Responsibilities

  • Maintain and operate internal services for the rest of the Shakudo employees, including proprietary applications for sales and ETL pipelines
  • Maintain and operate DGX machines that host LLMs for the team's use
  • Maintain and operate Shakudo's product for Shakudo's internal use, and contribute to product hardening, security, and DevOps practices
  • Maintain and operate physical servers for Kubernetes clusters and ensure uptime
  • Create CI/CD pipelines for internal deployments
  • Maintain and operate the AI Gateway product for customers, ensure uptime, and contribute to product roadmap
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service