We are a non-profit AI institute, focused on developing foundational AI research and innovation to deliver real-world impact through large-scale open models, data, and artifacts (e.g., Olmo, Tulu, Molmo). We unite the best and brightest scientific and engineering minds to explore the potential of truly open AI. Through our efforts, including the pioneering Olmo releases, we endeavor to empower academics, researchers, and AI developers more broadly to advance language models and generative AI models. Through close collaboration, we rapidly identify, define, and act on the most exciting and promising new ideas in AI. Our team engages in a broad range of AI research, including pre-training and post-training language models, curating data to enhance AI across different modalities, and developing novel methodologies to push the field forward. We study and evaluate AI models both theoretically and empirically, aiming to advance their capabilities. Additionally, we create impactful real-world applications, such as in scientific synthesis. Our goal is to develop state-of-the-art models that excel in scientific discovery, reasoning, and factual recall. You will be a part of the core team of research and machine learning engineers working on the infrastructure, architecture, modeling and training of Olmo (Open Language Model) at all stages: pre-training, mid-training, post-training and all emerging paradigms. In this role you will be owning the design and implementation of the systems that train these models. You will be responsible for building scalable machine learning pipelines as we push the boundaries of large language modeling research. You will be collaborating with colleagues inside and outside your own team, but you are responsible for a feature or experiment from start to finish, from conception to implementation.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
101-250 employees