Lenovo-posted 3 months ago
Full-time
Morrisville, NC
Computer and Electronic Product Manufacturing

We are seeking a highly motivated and skilled Model Development Engineer to join our rapidly growing AI team in Morrisville, NC. You will play a critical role in the training of large language models (LLMs), large vision models (LVMs), and large multimodal models (LMMs), including fine-tuning and reinforcement learning. This is a challenging yet rewarding opportunity to contribute to cutting-edge research and development in generative AI. You'll be working with a collaborative team to push the boundaries of what's possible with AI models and deploy them into innovative products.

  • Design, implement, and evaluate training pipelines for large generative AI models, encompassing multiple stages of post-training.
  • Design, implement, and evaluate data augmentation pipelines to increase the diversity and robustness of training datasets, improving model performance, particularly in low-data regimes.
  • Develop and implement model evaluation pipeline for LLMs.
  • Developing and executing Supervised Fine-tuning (SFT) strategies for specific tasks.
  • Developing and leveraging RLHF algorithms in model training such as DPO and KTO.
  • Exploring RL training strategies, sampling, reward function design, etc. to apply large scale RL to model training.
  • Implement and evaluate model quantization techniques to reduce model size and accelerate inference speed, balancing precision loss with performance gains for deployment across diverse hardware platforms.
  • Utilizing techniques for efficient fine-tuning of large language models, balancing performance and resource constraints, and tailoring model performance for downstream tasks well.
  • Experiment with various training techniques, hyperparameters, and model architectures to optimize performance and efficiency.
  • Develop and maintain data pipelines for processing and preparing training data.
  • Stay up-to-date with the latest advancements in large language models, training techniques, and related technologies.
  • Collaborate with other engineers and researchers to design, implement, and deploy AI-powered products.
  • Contribute to the development of internal tools and infrastructure for model training and evaluation.
  • Ph.D degree in Computer Science, Machine Learning, or a related field.
  • Strong programming skills in Python and experience with deep learning frameworks like Pytorch, Transformers.
  • Solid understanding of machine learning principles, including supervised learning, unsupervised learning, and reinforcement learning.
  • Proven experience in designing and conducting experiments, analyzing data, and drawing meaningful conclusions.
  • Familiarity with large language models, transformer architectures, and related concepts.
  • Experience with data processing tools and techniques (e.g., Pandas, NumPy).
  • Experience with distributed training frameworks (e.g., DeepSpeed, Megatron-LM).
  • Excellent communication, collaboration, and problem-solving skills.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service