Nebius-posted 9 days ago
Full-time • Mid Level
Remote
1,001-5,000 employees

Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field. Nebius seeks a Key Customers Solutions Architect to support key and strategic Nebius GPU Cloud services customers. In this role, you will be a trusted technical advisor, helping clients design, deploy, and scale AI solutions while managing large-scale GPU workloads involving hundreds to thousands of GPUs. You will also collaborate with sales and product teams to drive growth and enhance customer satisfaction. You’re welcome to work remotely from the United States or Canada.

  • Serve as the primary technical point of contact, troubleshooting and resolving complex AI/ML.
  • Guide customers in optimizing GPU performance for ML training and inference workloads, ensuring seamless integration and scalability.
  • Partner with the sales team to identify new opportunities, promote the latest products, and deliver technical presentations.
  • Act as a bridge to product teams, providing customer feedback, relaying feature requests, and ensuring alignment with customer requirements.
  • Engage with internal and external stakeholders, negotiate solutions, and effectively drive alignment to address customer challenges.
  • Experience: 5 - 10 + years in roles like Cloud Solutions Architect, Technical Account Manager, or Customer Engineer, with hands-on experience in cloud services and AI/ML workloads.
  • Proficiency in Infrastructure as Code (IaC) tools like Terraform and Ansible.
  • Experience with Kubernetes and Python programming.
  • Solid understanding of GPU computing, including ML training, inference workloads, and GPU stacks (e.g., CUDA, OpenCL).
  • Customer-centric approach with a proven ability to build trust and foster long-term relationships.
  • Strong ability to explain technical concepts to technical and non-technical audiences.
  • Hands-on experience with HPC/ML orchestration frameworks (e.g., Slurm, Kubeflow).
  • Experience with deep learning frameworks (e.g., PyTorch, TensorFlow).
  • Familiarity with ML tools from NVIDIA, AWS, Azure, and Google Cloud providers.
  • Strong project management skills, with the ability to prioritize tasks and deliver on deadlines.
  • Proven experience mentoring technical teams and driving team growth.
  • Expertise in stakeholder negotiation to support problem resolution and ensure seamless collaboration.
  • Health Insurance: 100% company-paid medical, dental, and vision coverage for employees and families.
  • 401(k) Plan: Up to 4% company match with immediate vesting.
  • Parental Leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
  • Remote Work Reimbursement: Up to $85/month for mobile and internet.
  • Disability & Life Insurance: Company-paid short-term, long-term, and life insurance coverage.
  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service