About The Position

Sony AI America is seeking research interns to join their team focused on fundamental and applied research in building next-generation foundation models for vision in a responsible manner. The intern will develop efficient and effective methodologies and prototype solutions, working with scientists and engineers on challenging problems in foundation models and generative AI. This includes low-cost vision foundation models (VFM), vision-language models (VLM), unified models, automatic model compression, optimization, and deployment on cloud and edge. The work has the potential to be published in papers and improve the experience of billions of customers.

Requirements

  • Currently has, or is in the process of obtaining, a master/PhD degree in computer science or related field.
  • Be very self-motivated and capable of proposing and implementing innovative ideas.
  • Solid presentation and communication skills to internal and external audiences.
  • Publications or expertise in compact foundation model development and deployment.
  • Influential open-source projects or paper publication at top conferences, e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, ACL, etc.
  • Solid coding skills in Python, Pytorch, etc.

Nice To Haves

  • Better to have front-end development experience.

Responsibilities

  • Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployment on cloud and edge.
  • Design or implement state-of-the-art techs on model compression, inference speedup, deployment on hardware, tool automation.
  • PoC for various vision+text, generation relevant tasks (VQA, captioning, understanding, etc.) and hardware.
  • Contribute to library and tool development to support business.
  • Publish influential research in top-tier conferences and journals.

Benefits

  • Eligible for overtime
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service