About The Position

Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field. Where we work Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team. The Role We are looking for a IT Support Manager in our data center, the key person in managing IT infrastructure maintenance. You will manage L1, L2, IT infrastructure and Field Network Engineers supporting data center IT infrastructure and GPU clusters. This role is focused on ensuring stable, efficient, and scalable operations through people leadership, structured support processes, and hands-on technical oversight. You will own day-to-day IT support operations, oversee maintenance and incident resolution for GPU clusters, and ensure high service levels across hardware, network, and field support activities. You will act as a key escalation point, drive operational improvements, and align support execution with broader data center objectives.

Requirements

  • 3+ years of experience managing technical support teams in a data center or similar critical infrastructure environment.
  • Strong hands-on background in diagnosing and resolving server hardware issues, including GPU-based systems, fiber networks.
  • Solid understanding of data center operations, server platforms, and enterprise networking principles.
  • Practical experience with IT service management processes (ITIL / ITSM).
  • Basic proficiency with Linux/Unix operating systems and command-line tools.
  • Proven ability to manage operational KPIs, incidents, and escalations.
  • Analytical skills
  • Proactive mindset, strong sense of ownership, and the ability to lead teams in high-pressure environments.
  • High proficiency in spoken and written English.

Nice To Haves

  • Relevant certifications and trainings

Responsibilities

  • Lead and manage L1, L2, IT infrastructure and Field Network Engineers supporting data center IT infrastructure and GPU clusters and region IX colocations.
  • Ensure high availability and reliability of GPU clusters and IT infrastructure
  • Act as the primary escalation point for complex hardware, network, and operational incidents
  • Monitor and drive SLA, KPI, and incident resolution performance across IT support services
  • Oversee proactive maintenance, troubleshooting, and problem management activities
  • Collaborate with infrastructure engineers, vendors, and colocation partners to resolve issues and deliver improvements
  • Maintain and improve support processes, documentation, and team training materials
  • Manage IT asset lifecycle activities, relates to responsibility area, including RMAs

Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service