AI & HPC Infrastructure Architect

AccentureIrving, TX
35d

About The Position

The Global Infrastructure Engineering AI & HPC team is at the center of enabling infrastructure reinvention for the next era of digital solutions powered by AI and High-Performance Computing (HPC). We bring together deep technical expertise across cloud, on-prem, and hybrid environments to design, build, and operate accelerated infrastructure that powers high-performance workloads at scale. Our solutions enable some of our most strategic and mission-critical clients to unlock new levels of performance, efficiency, and innovation. Our remit spans the full lifecycle-from strategy and architecture through implementation and operations-driving modernization across the entire infrastructure stack. We collaborate across the ecosystem to harness emerging technologies, fuel growth, and transform industries. In this rapidly growing market, our team is leading the way in shaping how enterprises leverage AI and HPC to drive breakthrough innovation and reimagine what's possible in infrastructure.

Requirements

  • Minimum 4+ years' experience advising and engaging with C-Suite executives and senior leadership, translating complex AI and HPC technologies into business and strategic value
  • Minimum 4+ years of experience with infrastructure components including XPUs, high-performance fabrics (InfiniBand, Ethernet), and modern storage/data platforms (e.g. NVMe-oF, Lustre, BeeGFS, VAST, DDN, Weka)
  • Minimum 4+ years' experience with orchestration and management frameworks (Slurm, Kubernetes, Docker) and performance/monitoring tools for AI/HPC environments
  • Minimum 4+ years' experience of MLOps, DevSecOps, and automation principles (Terraform, Ansible) as they apply to large-scale, secure, and reproducible workflows
  • Bachelor's degree or equivalent (minimum 12 years) work experience. (If Associate's Degree, must have minimum 6 years work experience)

Nice To Haves

  • Experience advising or overseeing large-scale AI/HPC deployments (1,000+ GPUs or clusters of 100+ servers), providing architecture and strategic guidance
  • Familiarity with GPU computing and accelerator ecosystems (NVIDIA CUDA, AMD ROCm) and integration considerations for HPC/AI workloads
  • Knowledge of AI/ML frameworks (TensorFlow, PyTorch) and their operational and performance implications in HPC/AI environments
  • Industry experience in Life Sciences, Resources, Automotive, Financial Services, Telecommunications, or other HPC/AI-intensive sectors
  • Relevant cloud or infrastructure certifications (e.g., AWS Solutions Architect, GCP Professional Data Engineer) or equivalent technical credentials
  • Experience in workload planning, optimization, and orchestration guidance to align infrastructure with business and research objectives

Responsibilities

  • Define and architect AI and HPC infrastructure strategies, selecting the right on-premises, cloud, and hybrid platforms, tools, and workflows to support diverse workload requirements
  • Develop end-to-end architecture roadmaps tailored to industry or functional needs, aligning business objectives with technical capabilities
  • Lead client workshops, assessments, and training sessions to guide adoption of AI/HPC technologies, best practices, and operating models
  • Provide thought leadership on emerging technologies (accelerators, composable systems, orchestration frameworks) to inform client strategies and enhance future-state designs

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Professional, Scientific, and Technical Services

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service