About The Position

Red Hat® OpenShift® AI is a flexible, scalable artificial intelligence (AI) and machine learning (ML) platform that enables enterprises to create and deliver AI-enabled applications at scale across hybrid cloud environments. Built using open-source technologies, OpenShift AI provides trusted, operationally consistent capabilities for teams to experiment, serve models, and deliver innovative apps. The OpenShift AI team seeks a Software Engineer with Kubernetes and Model Inference Runtimes experience to join our rapidly growing engineering team. Our team focuses on making machine learning model deployment and monitoring seamless and scalable across the hybrid cloud and the edge. This is a fascinating opportunity to build and impact the next generation of hybrid cloud MLOps platforms.

Requirements

  • Highly experienced with programming in Python and PyTorch
  • Familiarity with model parallelization, quantization, and memory optimization using vLLM, TGI, and other inference libraries.
  • Experience with Python packaging, such as PyPI libraries
  • Solid understanding of the fundamentals of model inference architectures
  • Experience with Jenkins, Git, shell scripting, and related technologies
  • Experience with the development of containerized applications in Kubernetes
  • Experience with Agile development methodologies
  • Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud
  • Ability to work across a large, distributed, hybrid engineering team

Nice To Haves

  • Experience with open-source development is a plus
  • Development experience with C++, especially with the CUDA APIs, is a big plus

Responsibilities

  • Develop and maintain a high-quality, high-performing ML inference runtime platform for multi-modal and distributed model serving.
  • Contribute directly to upstream inference runtime communities such as vLLM , TGI , PyTorch , OpenVINO , and others.
  • Maintain CI/CD build pipelines for container images that allow faster, more secure, reliable, and frequent releases
  • Coordination and communication with various stakeholders
  • Applying a growth mindset by staying up to date with AI and ML advancements

Benefits

  • Comprehensive medical, dental, and vision coverage
  • Flexible Spending Account - healthcare and dependent care
  • Health Savings Account - high deductible medical plan
  • Retirement 401(k) with employer match
  • Paid time off and holidays
  • Paid parental leave plans for all new parents
  • Leave benefits including disability, paid family medical leave, and paid military leave
  • Additional benefits including employee stock purchase plan, family planning reimbursement, tuition reimbursement, transportation expense account, employee assistance program, and more!

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service