Engineer, AI Model

LenovoMorrisville, NC
131d

About The Position

We are seeking a highly motivated and skilled Model Development Engineer to join our rapidly growing AI team in Morrisville, NC. You will play a critical role in the training of large language models (LLMs), large vision models (LVMs), and large multimodal models (LMMs), including fine-tuning and reinforcement learning. This is a challenging yet rewarding opportunity to contribute to cutting-edge research and development in generative AI. You'll be working with a collaborative team to push the boundaries of what's possible with AI models and deploy them into innovative products.

Requirements

  • Master's degree in Computer Science, Machine Learning, or a related field and 2+ years of relevant work experience or 4+ years of relevant work experience.
  • Strong programming skills in Python and experience with deep learning frameworks like PyTorch.
  • Solid understanding of machine learning principles, including supervised learning, unsupervised learning, and reinforcement learning.
  • Proven experience in designing and conducting experiments, analyzing data, and drawing meaningful conclusions.
  • Familiarity with large language models, transformer architectures, and related concepts.
  • Experience with data processing tools and techniques.
  • Excellent communication, collaboration, and problem-solving skills.

Responsibilities

  • Design, implement, and evaluate training pipelines for large generative AI models, encompassing multiple stages of post-training.
  • Design, implement, and evaluate data augmentation pipelines to increase the diversity and robustness of training datasets, improving model performance, particularly in low-data regimes.
  • Develop and implement model evaluation pipeline for LLMs.
  • Developing and executing Supervised Fine-tuning (SFT) strategies for specific tasks.
  • Developing and leveraging RLHF algorithms in model training such as DPO and KTO.
  • Exploring RL training strategies, sampling, reward function design, etc. to apply large scale RL to model training.
  • Implement and evaluate model quantization techniques to reduce model size and accelerate inference speed, balancing precision loss with performance gains for deployment across diverse hardware platforms.
  • Utilizing techniques for efficient fine-tuning of large language models, balancing performance and resource constraints, and tailoring model performance for downstream tasks well.
  • Experiment with various training techniques, hyperparameters, and model architectures to optimize performance and efficiency.
  • Develop and maintain data pipelines for processing and preparing training data.
  • Stay up-to-date with the latest advancements in large language models, training techniques, and related technologies.
  • Collaborate with other engineers and researchers to design, implement, and deploy AI-powered products.
  • Contribute to the development of internal tools and infrastructure for model training and evaluation.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Industry

Computer and Electronic Product Manufacturing

Education Level

Master's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service