Senior Cluster Engineer – AI Inference Infrastructure

QualcommSan Diego, CA
82d$111,300 - $166,900

About The Position

As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. We are seeking a Senior Cluster Engineer to design, deploy, and manage our AI inference cluster ecosystem. This role will deliver and deploy clusters providing high availability, scalability, and performance for our customers.

Requirements

  • 2+ years of experience in infrastructure engineering or HPC environments.
  • Deep expertise in Linux system administration and cluster orchestration (Kubernetes and Slurm).
  • Strong knowledge of datacenter networking fundamentals and RoCE/RDMA.
  • Proficiency in Python and Shell scripting for automation.
  • Hands-on experience with Ansible or similar automation tools.
  • Strong software engineering background (design patterns, CI/CD, testing).
  • Exposure to cloud platforms (AWS, Azure, GCP) and hybrid deployments.
  • Familiarity with AI inference frameworks and GPU-based workloads.

Responsibilities

  • Design and manage large-scale AI inference clusters.
  • Oversee server provisioning, networking, and OS lifecycle management in datacenters.
  • Implement automation frameworks for cluster deployment and maintenance.
  • Integrate Out-of-Band management using RedFish APIs.
  • Manage and optimize Kubernetes and Slurm clusters for AI workloads.
  • Ensure high-performance networking with RoCE/RDMA.
  • Build telemetry and observability systems using Prometheus and OpenTelemetry.

Benefits

  • Competitive annual discretionary bonus program.
  • Opportunity for annual RSU grants.
  • Comprehensive benefits package designed to support success at work, at home, and at play.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Bachelor's degree

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service