Senior Research Engineer, Olmo

The Allen Institute for Artificial Intelligence•Seattle, WA

28d•$170,000 - $220,000•Onsite

About The Position

We are a non-profit AI institute, focused on developing foundational AI research and innovation to deliver real-world impact through large-scale open models, data, and artifacts (e.g., Olmo, Tulu, Molmo). We unite the best and brightest scientific and engineering minds to explore the potential of truly open AI. Through our efforts, including the pioneering Olmo releases, we endeavor to empower academics, researchers, and AI developers more broadly to advance language models and generative AI models. Through close collaboration, we rapidly identify, define, and act on the most exciting and promising new ideas in AI. Our team engages in a broad range of AI research, including pre-training and post-training language models, curating data to enhance AI across different modalities, and developing novel methodologies to push the field forward. We study and evaluate AI models both theoretically and empirically, aiming to advance their capabilities. Additionally, we create impactful real-world applications, such as in scientific synthesis. Our goal is to develop state-of-the-art models that excel in scientific discovery, reasoning, and factual recall. You will be a part of the core team of research and machine learning engineers working on the infrastructure, architecture, modeling and training of Olmo (Open Language Model) at all stages: pre-training, mid-training, post-training and all emerging paradigms. In this role you will be owning the design and implementation of the systems that train these models. You will be responsible for building scalable machine learning pipelines as we push the boundaries of large language modeling research. You will be collaborating with colleagues inside and outside your own team, but you are responsible for a feature or experiment from start to finish, from conception to implementation.

Requirements

Expertise at building ML infrastructure - having 4+ years of industry experience building infrastructure that handles data preprocessing/transformation and model training, evaluation, inference, and deployment
Deep experience in the complete model development cycle, including data set construction, training, tuning, evaluation, performance profiling, and monitoring
Knowledge of modern deep learning and natural language processing techniques
Strong software engineering skills, particularly around building performant systems and debugging
At-home with hands-on programming – must have experience with Python and PyTorch/Jax/Tensorflow. We expect you to be the kind of engineer who can pick up a new programming language, library, or API as needed without it being a big deal.
Familiarity working with cloud compute resources (e.g. AWS) and containerization (e.g. Docker)
Strong collaboration and communication skills - our environment is small and collaborative, and we'd like you to thrive while working closely with others, sometimes with complementary skills/perspectives

Nice To Haves

Advanced degree in Data Science/CS/EE/Applied Mathematics/Statistics/ML/NLP or related fields and/or relevant and equivalent engineering experience
Contributions to open-source ML or research libraries (e.g. spaCy, AllenNLP, transformers)
Experience successfully operating models at scale in a production setting
Experience in HPC settings
Curiosity about AI research

Responsibilities

Building infrastructure to facilitate the next generation of LLM research
Optimizing training and inference for language models
Triaging between experiments and executing on the most impactful
Supporting and collaborating with an open-source community
Bridging the gap between cutting-edge research and a widely adopted product
Bringing software engineering best practices to a research environment
Releasing your contributions back to the broader community in the form of open source software, model releases, and additions to Ai2’s public API and open research datasets, as well as technical reports

Benefits

Team members and their families are covered by medical, dental, vision, basic life insurance, basic accidental death and dismemberment insurance, short-term disability, long-term disability, and an employee assistance program.
Team members are able to enroll in our voluntary life insurance program, our voluntary accidental death and dismemberment program, our health savings account plan, our healthcare reimbursement arrangement plan, and our health care and dependent care flexible spending account plans.
Team members are able to enroll in our company’s 401k plan.
Team members will receive $125 per month to assist with commuting or internet expenses and will also receive $200 per month for fitness and wellbeing expenses.
Team members will also receive up to ten sick days per year, up to seven personal days per year, up to 20 vacation days per year and twelve paid holidays throughout the calendar year.
Team members will be able to receive annual bonuses and can participate in the long-term incentive plan.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume