Technical Program Manager, Data Center Operations

FluidstackAustin, TX
$200,000 - $270,000

About The Position

Fluidstack is building civilization-scale infrastructure for AI, aiming to deliver 10 to 100s of GWs of compute faster than anyone else. This involves rethinking every layer of the stack, from acquiring power to designing, building, and operating data centers. The company operates with a focus on extreme ownership, velocity, first principles, and a passion for the problem space. The Data Center Operations Team is responsible for operating at the scale of a nation, managing sites that come online in pieces while keeping live ones running flawlessly, and establishing new operational standards due to the unprecedented speed and scale of operations.

Requirements

  • Program management experience in mission-critical environments where a delayed handover or missed SOP had real operational consequences.
  • Experience designing operational frameworks from scratch: handover gates, SOP libraries, incident management programs.
  • Ability to quarterback across design, construction, supply chain, and site ops teams simultaneously.
  • Clear written communication skills to distill complex operational issues for various stakeholders.
  • Experience tracking incident trends and CAPA status in live dashboards and following corrective actions through to closure.
  • Experience personally building or maintaining SOPs and measuring their actual adherence.

Nice To Haves

  • ITIL, PMP, or PgMP certification.
  • Hyperscale or large colo operator experience.
  • Familiarity with ASHRAE, Uptime Institute, or TIA-942 standards.
  • Exposure to datacenter construction and commissioning processes.

Responsibilities

  • Own the end-to-end site handover framework: define the gates, acceptance criteria, and sign-off procedures that move a new facility from construction to live operations without dropped terms or late surprises.
  • Embed into design, construction, and due diligence teams early enough to shape maintainability requirements before they become field problems.
  • Drive the cross-functional handover rhythm across training, documentation, systems access, and knowledge transfer, surfacing blockers weeks before they hit the go-live schedule.
  • Build and maintain the SOPs that govern critical datacenter operations across the fleet, with metrics that track adoption, execution quality, and efficiency at each site.
  • Lead incident management and stability improvement programs, including post-incident reviews with root cause analysis, corrective action tracking, and preventive maintenance oversight that reduces unplanned outages across the global footprint.
  • Produce the dashboards and reporting that give leadership visibility into stability metrics and incident trends, and run the CAPA programs that turn that data into durable fixes.

Benefits

  • Competitive total compensation package (salary + equity)
  • Retirement or pension plan, in line with local norms
  • Health, dental, and vision insurance
  • Generous PTO policy, in line with local norms
  • Equity in the form of stock options
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service