About The Position

Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field. Where we work Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team. The role Nebius operates large-scale, GPU-dense AI infrastructure across mission-critical data center environments. As a Senior Delivery Deployment Engineer, you will own the end-to-end delivery, deployment, and production readiness of next-generation GPU platforms inside our data centers. This role sits at the intersection of hardware, Linux systems, and operational execution. You will lead on-site rack bring-up, validate NVIDIA-based AI systems, coordinate repairs, and ensure GB-series infrastructure moves from installation to fully operational production environments with precision and reliability. You will collaborate closely with hardware engineering, networking, and infrastructure teams to deploy and stabilize H200 and B200-based GPU systems at scale.

Requirements

  • Strong hands-on experience deploying and operating data center infrastructure
  • Deep familiarity with GPU-dense systems, ideally NVIDIA H-series platforms
  • Experience working with high-density rack deployments (GB-series or similar)
  • Solid Linux experience, including troubleshooting and scripting
  • Ability to diagnose issues across hardware, OS, firmware, and network layers
  • Experience coordinating field repairs and working directly with hardware vendors
  • Proven experience leading technical teams or overseeing field operations
  • High ownership mindset and ability to operate in production-critical environments
  • Clear communication skills and ability to collaborate across distributed teams

Nice To Haves

  • Experience deploying AI or HPC clusters at scale
  • Familiarity with automated provisioning or infrastructure lifecycle systems
  • Background in hardware qualification, burn-in testing, or factory validation
  • Experience supporting rapid infrastructure expansion
  • Exposure to ARM-based or heterogeneous compute environments

Responsibilities

  • Lead end-to-end deployment of GB-series racks within data center environments
  • Oversee installation, bring-up, validation, and production readiness of NVIDIA H200 and B200-based servers
  • Troubleshoot complex hardware, firmware, Linux OS, and networking issues
  • Execute structured testing and validation procedures during deployment
  • Develop and maintain basic Linux-based hardware health-check and diagnostic scripts
  • Coordinate on-site hardware repairs, part replacements, and vendor escalations
  • Drive root cause analysis and ensure corrective actions are implemented
  • Manage and prioritize deployment timelines across multiple concurrent rollouts
  • Provide technical leadership and guidance to on-site engineers and technicians
  • Partner with networking and infrastructure teams to ensure seamless integration
  • Document deployment processes, validation standards, and operational runbooks

Benefits

  • Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families
  • 401(k) plan: up to 4% company match with immediate vesting
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers
  • Remote work reimbursement: up to $85/month for mobile and internet
  • Disability & life insurance: company-paid short-term, long-term, and life insurance coverage
  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service