The AI Cloud Infrastructure Engineer owns the cloud infrastructure, environment architecture, compute management, networking, and platform operations that enable the Forge to build, deploy, scale, and operate AI and agentic systems in production with enterprise-grade reliability, security, and governance. This is a hands-on senior infrastructure engineering role. The engineer designs and operates the cloud environments, container platforms, networking layers, identity boundaries, deployment pipelines, and runtime infrastructure that AI and agentic workloads depend on. Azure is the primary cloud, with support for AWS and Google Cloud where specific AI services or workload requirements warrant multi-cloud deployment. Daily work includes provisioning and managing cloud environments, designing and maintaining container orchestration platforms, building Infrastructure as Code, managing compute and GPU resources for AI workloads, configuring networking and environment isolation, operating CI/CD deployment infrastructure, implementing identity and access controls at the infrastructure layer, instrumenting observability and telemetry, optimizing cost and performance, and ensuring all infrastructure meets Forge security, governance, and operational standards. This role is the foundation that everything else in the Forge runs on. If the infrastructure is wrong, nothing built on top of it will be reliable, secure, or scalable.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal
Number of Employees
5,001-10,000 employees