VP, AI Infrastructure - Highrise.ai

Hut 8San Francisco, CA
1dRemote

About The Position

Hut 8 is scaling our GPU platform that integrates power, colocation, and compute into a single, operationally owned stack. Customers don’t ask whether we have power or GPUs — they ask who actually runs the operation. This role exists to be that answer. Hut 8 leadership owns strategy, capital, and customer relationships. You own execution. You are accountable for taking raw, power-backed infrastructure and turning it into production-ready, SLA-backed GPU capacity — at scale. This is not a lab role or a traditional enterprise ops role. It is enterprise-grade operational discipline applied to hyperscale infrastructure, built on direct control of power, facilities, and compute.

Requirements

  • 10+ years in large-scale infrastructure or hyperscale data center operations
  • 5+ years operating GPU-accelerated and/or HPC environments
  • Direct experience deploying and operating 10,000+ GPUs in managed, production settings
  • Deep expertise in RDMA networking (InfiniBand and/or RoCE)
  • Proven ownership of 99.9%+ uptime and customer-facing SLAs
  • Hands-on operator mindset — you drive issues from detection → root cause → resolution
  • Track record of building repeatable deployment and commissioning playbooks
  • Experience leading teams of 20–50+ across field ops, deployment, and infrastructure
  • Senior leadership background (Director / Senior Director / VP) at a recognized operator

Nice To Haves

  • Direct working relationships with NVIDIA and/or AMD
  • Experience with H100, H200, GB200, and/or MI300X platforms
  • Multi-site, parallel deployment experience
  • Background spanning greenfield buildouts and steady-state hyperscale operations

Responsibilities

  • Deployment, commissioning, and production sign-off
  • Performance, uptime, and SLA ownership
  • Incident response, escalation, and root cause resolution
  • OEM, vendor, and hardware lifecycle management
  • Scaling operations from ~1,100 GPUs to 20,000+ GPUs
  • RDMA networking (InfiniBand and/or RoCE)
  • Multi-tenant and single-tenant production environments
  • 99.9%+ availability targets
  • Repeatable, auditable commissioning processes
  • Enterprise readiness layered onto hyperscale infrastructure
  • Power, cooling, rack density, and facility readiness
  • Deployment sequencing and capacity expansion
  • Operating constraints driven by energy, thermal, and site realities
  • Field operations, deployment, and infra ops teams
  • Hiring, structure, and escalation paths as scale ramps
  • Clear ownership across sites and customers
  • Daily standups on deployment progress, risks, and blockers
  • Direct oversight of GPU failures, RDMA performance, and thermal issues
  • Production sign-off for each deployment tranche
  • Weekly capacity and readiness planning with facilities
  • Monthly OEM and vendor performance reviews
  • Quarterly planning for expansion, refresh cycles, and new platforms

Benefits

  • Hut 8 offers a benefits and wellness program that includes medical, dental, vision, life, and short-term and long-term disability insurance, as well as paid time off.
  • At Hut 8, you will have the opportunity to: ▶ Work with bright, driven peers from a range of educational and professional backgrounds including software development, energy, engineering, entrepreneurship, investment banking, private equity, and management consulting ▶ Design and pitch new products, services, and other initiatives to a leadership team consisting of serial entrepreneurs and seasoned executives and backed by a board of directors consisting of industry veterans of energy, finance, and government ▶ Debate ideas and alternatives in a truly meritocratic setting where the learning curve is steep and the lessons come from both senior and junior members of the team ▶ Build a lifelong network of friends and professional connections at the cutting-edge intersection of technology, energy, and infrastructure
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service