Member of Technical Staff, Agents Modeling

Cohere•New York, NY

129d

About The Position

We’re looking for an experienced machine learning researcher / engineer who can help us push the frontiers of agentic LLM systems. As a part of the team, you will help drive exploration and development of agentic techniques and have the opportunity to build the models that power our agentic solutions. Agentic LLM systems are being deployed widely across enterprise companies including through Cohere’s North platform. In this role, you’ll be working with a team developing new strategies for training models for advanced agent capabilities including reasoning, tool use, and memory. This includes developing data-generation techniques for post-training (SFT and RL*) Cohere’s models. Model advancements have direct impacts on North and other Cohere products creating an exciting opportunity where core model development leads to direct product advancements.

Requirements

Have a PhD in computer science or related field or similar industry research experience
Strong software engineering skills
Proficiency in Python and experience with ML-related code (e.g., pytorch, numpy, etc.)
Experience with LLMs and agentic frameworks
Experience with post-training LLMs (SFT, PEFT, or RL*)
Experience with building synthetic data generation pipelines

Responsibilities

Design and develop novel agentic solutions
Improve upon SOTA on hard agentic tasks
Research the next-generation of on-line learning-from-experience self-improvement
Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
Work with an amazing team of researchers and engineers pushing the boundaries

Benefits

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend
6 weeks of vacation

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Ph.D. or professional degree

Number of Employees

251-500 employees

Member of Technical Staff, Agents Modeling

About The Position

Requirements

Responsibilities

Benefits

What This Job Offers

Job Search Resources

Tools

Career Hubs

Guides

Company