Data Center Manager

TensorWaveNew Kensington, PA
4d

About The Position

TensorWave is a GPU cloud infrastructure provider delivering high-performance compute to the world’s most demanding AI and machine learning workloads. We operate data centers across the United States and are scaling rapidly to meet the explosive demand for GPU compute. We are building a Data Center Operations team for each Data Center and we’re looking for a leader to lead this critical function. This role is responsible for the heartbeat of our physical infrastructure. You will be responsible for the 24/7 availability, security, and efficiency of our data center environment. This isn’t just about keeping the lights on—it’s about optimizing high-density compute environments, leading a high-performing technical team, and ensuring our hardware lifecycle is seamless. You will bridge the gap between high-level operational strategy (reporting to the Director) and hands-on execution, ensuring that our "five nines" uptime isn't just a goal, but a reality. This is a unique opportunity to grow a function that directly impacts every customer’s experience. In this role, you aren't just managing boxes; you are the guardian of the data that drives our business. We offer a fast-paced environment where your expertise in streamlining the RMA process and developing technical talent will have a direct impact on our bottom line.

Requirements

  • 5+ years in data center operations, with at least 2 years in a formal leadership or supervisory role.
  • Demonstrated experience supporting 24/7 customer-facing operations, including shift scheduling and on-call management
  • Hands-on experience with monitoring and observability platforms (e.g. Grafana, Prometheus, or similar)
  • Proven ability to hire, train, and lead technical operations teams
  • Excellent written and verbal communication skills, particularly in high-pressure incident scenarios
  • Experience with ticketing and incident tracking systems (e.g., PagerDuty, ServiceNow, Jira, or equivalent)

Nice To Haves

  • Certified Data Center Professional (CDCP) or Specialist (CDCS).
  • ITIL Foundation certification.
  • Project Management Professional (PMP) or similar.
  • OSHA10 or higher certification

Responsibilities

  • People Management: Lead, mentor, and schedule a team of 8 Data Center Technicians, fostering a culture of technical excellence and accountability.
  • Performance Tracking: Conduct regular 1:1s, performance reviews, and skill-gap assessments to ensure the team stays ahead of evolving technologies (e.g., liquid cooling, AI-optimized racking).
  • Workflow Optimization: Oversee the Inventory/RMA Specialist to ensure hardware replacement cycles are lean and that "dead on arrival" (DOA) equipment is processed with minimal downtime.
  • Infrastructure Oversight: Manage the installation, cabling, and decommissioning of server, storage, and networking hardware.
  • Uptime Management: Act as the primary escalation point for data center incidents, coordinating emergency repairs and root cause analysis (RCA).
  • Capacity Planning: Monitor power, cooling, and space utilization. Partner with the Director to forecast future growth and avoid "stranded capacity."
  • Vendor Management: Supervise third-party contractors (HVAC, Electrical, Security) to ensure maintenance is performed without disrupting operations.
  • Asset Management: Ensure 100% accuracy in the DCIM (Data Center Infrastructure Management) database.
  • Security & Safety: Enforce strict physical security protocols and OSHA safety standards.
  • Audit Readiness: Lead the facility through SOC2, ISO, or HIPAA compliance audits by maintaining pristine documentation and procedural logs.

Benefits

  • Competitive salary
  • Stock options
  • 100% paid Medical, Dental, and Vision insurance
  • Flexible PTO
  • Paid Holidays
  • 401(k)
  • Parental Leave
  • Flexible Spending Account
  • Short Term Disability Insurance
  • Life and Voluntary Supplemental Insurance
  • Mental Health Benefits through Spring Health
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service