Thinking Machines Lab is seeking Pre-Training Researchers to join their team. This role is central to the company's roadmap, blending research with large-scale data engineering to build the datasets and data systems for the next generation of AI models. The ideal candidate will design and implement methods for sourcing, curating, and analyzing pre-training data for quality and performance, working with both automated pipelines and human-in-the-loop processes. This position requires strong coding skills and the ability to contribute scientific insight. It is suited for individuals who enjoy the intersection of data, machine learning, and systems, and are excited by the challenge of shaping frontier AI. The role emphasizes both fundamental research and practical engineering, requiring the ability to write high-performance code and analyze technical reports. This is an evergreen role, meaning applications are continuously reviewed for current and future opportunities.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior