Intern, Deep Learning for Image and Video Processing

InterDigital, Inc.Los Altos, CA
75d$45 - $55

About The Position

The InterDigital AI Lab is seeking motivated Ph.D. and Master students for internships in Deep Learning for Image and Video Processing. The Lab’s research focuses on novel applications of AI and machine learning for video and wireless applications. Unlike traditional roles that involve tinkering with obscure parts of ads or social networks, you’ll have the opportunity to develop proof-of-concepts, prototypes, and research ideas that integrate machine learning and AI technologies with a wide range of video and wireless technologies. Your work will have the chance to be published at leading conferences. Our recent work focuses on distributed, real-time computer vision for edge-cloud systems. We reduce bandwidth usage by compressing intermediate feature maps using a practical mix of learned methods and standard video codecs (e.g., H.264). We mitigate network latency with Dedelayed, a delay-aware split inference technique recently submitted to ICLR. We also created the leading open-source library for deep learning-based image and video compression (github.com/InterDigitalInc/CompressAI), and an open-source platform used by MPEG to benchmark video coding for distributed vision models (github.com/InterDigitalInc/CompressAI-Vision).

Requirements

  • Ph.D. student in Computer Science/Engineering preferred, or relevant disciplines such as Electrical Engineering, Mathematics, Statistics, or Physics.
  • Strong background in at least one of computer vision, image processing, or video compression.
  • Proficient in PyTorch (preferred), TensorFlow, or JAX.

Nice To Haves

  • Experience with visually realistic Generative Adversarial Networks (GANs).
  • Unique or novel ideas in the field of AI and machine learning.

Responsibilities

  • Develop proof-of-concepts, prototypes, and research ideas that integrate machine learning and AI technologies with video and wireless technologies.
  • Conduct research in deep learning-based image and video compression and restoration.
  • Work on compression for multiple computer vision tasks and intermediate feature maps for split inference.
  • Explore real-time distributed video inference resilient to wireless latency, jitter, and bandwidth limits.
  • Engage in vision tasks such as semantic segmentation, depth estimation, object detection, and super-resolution.
  • Contribute to applications in autonomous driving, delivery drones, and robotics.

Benefits

  • Promote long-lasting collaborations with interns.
  • Opportunity to publish in leading conferences including CVPR, ICLR, ICCV, ICASSP, ICIP, DCC, PCS.
  • Collaborate with leading researchers in their fields.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Intern

Education Level

Ph.D. or professional degree

Number of Employees

251-500 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service