Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority. About Us Our team is part of Google DeepMind (GDM) in the Frontier-AI unit. We specialize in multimodal foundational models, with a focus on image and video domains. We are looking for a research scientist to develop agentic solutions to improve the capabilities of multimodal models in GDM. Candidates must have strong machine learning skills, including experience in LLMs and computer vision. We also require competency in software engineering, which is required to implement robust solutions at the scale that we operate. Our team values both internal and external impact and there should be opportunities for both in this role. The Role As a Research Scientist specializing in Multimodal Agents, you will be at the forefront of developing innovative agentic solutions to enhance the capabilities of Google DeepMind's foundational models, particularly within the image and video domains. This is an exciting opportunity to contribute directly to advancing the state of the art in artificial intelligence, working with cutting-edge technologies and a team of world-class experts. You will be instrumental in designing, implementing, and deploying robust machine learning solutions at scale, with a clear path to both internal and external impact through product integration and publications. This role offers a unique chance to shape the future of AI agents by pushing the boundaries of multimodal understanding and interaction.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Education Level
Ph.D. or professional degree