AI Infrastructure Engineer

ZoomSeattle, WA
6dHybrid

About The Position

We are seeking an experienced AI Infrastructure Engineer to join our AI Incubation team. You will build and optimize large-scale training infrastructure for Large Language Models (LLMs). The ideal candidate will combine engineering fundamentals with practical experience in AI infrastructure development, demonstrating both technical depth and the ability to deliver scalable solutions for complex AI systems. About the Team With eight specialized departments, the engineering team functions as a highly collaborative, diverse powerhouse. Each department mission is to deliver seamless and innovative communication solutions. These range from software development and machine learning to quality assurance teams that work to create and maintain Zoom's user-friendly interfaces and robust infrastructure. The team continues to push the boundaries of communication technology, bringing people together regardless of their physical distance.

Requirements

  • Possess a Bachelor's degree in Computer Science, Artificial Intelligence, Machine Learning, Cognitive Science, or a related field.
  • Have 5+ years of software engineering experience with a focus on infrastructure and systems
  • Exhibit expertise in GPU programming and CUDA optimization
  • Have experience with container technologies (Docker, Kubernetes), distributed systems and cloud computing
  • Have experience building large-scale distributed systems and optimizing neural network performance
  • Demonstrate programming skills in Python, C++, and CUDA, with deep learning frameworks (PyTorch, Transformers)
  • Have a deep understanding of neural network architectures and training methodologies

Responsibilities

  • Designing and developing scalable AI infrastructure solutions for training and deploying large language models
  • Building and optimizing distributed training platforms using cutting-edge technologies
  • Implementing and maintaining containerized AI environments using Docker and Kubernetes
  • Optimizing CUDA kernels for maximum GPU utilization and performance
  • Developing platform software to support AI/ML workflows
  • Collaborating with AI researchers to implement efficient training and inference pipelines

Benefits

  • As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service