Are you excited about the amazing potential of foundation models, LLMs, and multimodal LLMs? We are looking for individuals who thrive on collaboration and have a desire to push the boundaries of what is possible today! The Video Computer Vision org is a centralized applied research and engineering organization responsible for developing real-time on-device Computer Vision and Machine Perception technologies across Apple products. We balance research and product to deliver Apple quality, pioneering experiences, innovating through the full stack, and partnering with HW, SW, and ML teams to influence the sensor and silicon roadmap that brings our vision to life.We are seeking a highly motivated and skilled senior Applied Research Engineer to join our team. The ideal candidate will have a strong background in developing and exploring capabilities of foundation models and multimodal large language models that integrate various types of data such as text, image, video, and audio. The ideal candidate should have familiarity with agentic AI, reasoning, and large-scale evaluations of agentic systems. In this role, you will work on ground breaking research projects to advance our AI and computer vision capabilities, contributing to both foundational research and practical applications.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Industry
Computer and Electronic Product Manufacturing
Education Level
Ph.D. or professional degree
Number of Employees
5,001-10,000 employees