Staff Engineer, AI Model

Lenovo-posted 3 months ago

Full-time

Morrisville, NC

Computer and Electronic Product Manufacturing

Resume

Match Score

Upload and Match ResumeTrack Jobs with Teal

We are seeking a highly motivated and skilled Model Development Engineer to join our rapidly growing AI team in Morrisville, NC. You will play a critical role in the training of large language models (LLMs), large vision models (LVMs), and large multimodal models (LMMs), including fine-tuning and reinforcement learning. This is a challenging yet rewarding opportunity to contribute to cutting-edge research and development in generative AI. You'll be working with a collaborative team to push the boundaries of what's possible with AI models and deploy them into innovative products.

Design, implement, and evaluate training pipelines for large generative AI models, encompassing multiple stages of post-training.
Design, implement, and evaluate data augmentation pipelines to increase the diversity and robustness of training datasets, improving model performance, particularly in low-data regimes.
Develop and implement model evaluation pipeline for LLMs.
Developing and executing Supervised Fine-tuning (SFT) strategies for specific tasks.
Developing and leveraging RLHF algorithms in model training such as DPO and KTO.
Exploring RL training strategies, sampling, reward function design, etc. to apply large scale RL to model training.
Implement and evaluate model quantization techniques to reduce model size and accelerate inference speed, balancing precision loss with performance gains for deployment across diverse hardware platforms.
Utilizing techniques for efficient fine-tuning of large language models, balancing performance and resource constraints, and tailoring model performance for downstream tasks well.
Experiment with various training techniques, hyperparameters, and model architectures to optimize performance and efficiency.
Develop and maintain data pipelines for processing and preparing training data.
Stay up-to-date with the latest advancements in large language models, training techniques, and related technologies.
Collaborate with other engineers and researchers to design, implement, and deploy AI-powered products.
Contribute to the development of internal tools and infrastructure for model training and evaluation.

Ph.D degree in Computer Science, Machine Learning, or a related field.
Strong programming skills in Python and experience with deep learning frameworks like Pytorch, Transformers.
Solid understanding of machine learning principles, including supervised learning, unsupervised learning, and reinforcement learning.
Proven experience in designing and conducting experiments, analyzing data, and drawing meaningful conclusions.
Familiarity with large language models, transformer architectures, and related concepts.
Experience with data processing tools and techniques (e.g., Pandas, NumPy).
Experience with distributed training frameworks (e.g., DeepSpeed, Megatron-LM).
Excellent communication, collaboration, and problem-solving skills.

Track Jobs with Teal

Job Search Resources

•

AI Resume Builder

•

AI Engineer Resume Examples

•

AI Engineer Cover Letter Examples

Staff Engineer, AI Model

Job Search Resources

Tools

Career Hubs

Guides

Company