Sr. AI Platform Architect

KLAAnn Arbor, MI
12dOnsite

About The Position

We are seeking a highly skilled and motivated AI Platform Architect to join our Corporate IT Cloud & DevOps team. In this role, you will be responsible for architecting, building, and managing our Enterprise GenAI Hybrid Platform, enabling scalable and secure AI/ML workloads across on-premises and cloud environments. You will play a key role in operationalizing LLMOps capabilities, integrating open-source tools, and supporting distributed training and inferencing at scale

Requirements

  • Bachelor's Degree or equivale nt expe rience in Computer Science or related IT field
  • Eight (8) years of implementing and maintaining AI/ML Infrastructure in an On-Prem environment
  • Strong experience with AI/ML infrastructure and tools, including GPU clusters and Kubernetes
  • Proficiency in deploying and managing open-source GenAI components and vector databases
  • Hands-on experience with high-performance computing (HPC) environments
  • Expertise in designing and managing on-premises, cloud, and hybrid-based ML platforms
  • Strong Linux system administration and scripting skills

Responsibilities

  • Design, deploy, and manage scalable AI/ML infrastructure supporting hybrid cloud and on-prem environments
  • Work extensively with open-source MLOps platforms (e.g., Kubeflow, MLflow, Flyte) to streamline model development, deployment, and lifecycle management
  • Architect and optimize GenAI infrastructure, including integration of vector databases and large language model serving frameworks
  • Implement and manage high-performance shared storage systems (e.g., Ceph, MinIO) for distributed AI workloads
  • Set up and maintain InfiniBand networking for low-latency, high-throughput GPU cluster communication
  • Collaborate with ML engineers, data scientists, and DevOps teams to build a cohesive and efficient AI/ML ecosystem
  • Monitor and enhance infrastructure performance, ensuring scalability, reliability, and security
  • Evaluate and integrate emerging GenAI tools and frameworks to continuously improve platform capabilities

Benefits

  • medical
  • dental
  • vision
  • life, and other voluntary benefits
  • 401(K) including company matching
  • employee stock purchase program (ESPP)
  • student debt assistance
  • tuition reimbursement program
  • development and career growth opportunities and programs
  • financial planning benefits
  • wellness benefits including an employee assistance program (EAP)
  • paid time off and paid company holidays
  • family care and bonding leave

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service