Student Researcher [Seed - Multimodal Interaction & World Model - RL Focused] - 2026 Start (PhD)

ByteDance•San Jose, CA

86d

About The Position

The Seed Multimodal Interaction and World Model team is dedicated to developing models that boast human-level multimodal understanding and interaction capabilities. The team also aspires to advance the exploration and development of multimodal assistant products.

Responsibilities

Design and implement reinforcement learning (RL) training systems for large-scale multimodal foundation models
Develop unified modeling frameworks that integrate video, audio, and language, with a focus on visual latent reasoning
Explore RL-based approaches to bridge understanding and generation for multimodal visual reasoning
Collaborate with researchers to evaluate models on tasks involving world modeling, reasoning, and instruction-conditioned generation

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Industry

Publishing Industries

Number of Employees

5,001-10,000 employees

Student Researcher [Seed - Multimodal Interaction & World Model - RL Focused] - 2026 Start (PhD)

About The Position

Responsibilities

What This Job Offers

Job Search Resources

Tools

Career Hubs

Guides

Company