The VCV org is a centralized applied research and engineering organization responsible for developing real-time on-device Computer Vision and Machine Perception technologies across Apple products. In the Human Intelligence team, we balance research and product to deliver Apple quality, pioneering experiences, innovating through the full stack, and partnering with HW, SW, and ML teams to influence the sensor and silicon roadmap that brings our vision to life. Join us in this truly exciting era of Artificial Intelligence to help deliver the next groundbreaking Apple products & experiences! We are continuously advancing the state of the art in Computer Vision and Machine Learning, touching all aspects of multimodal LLMs, from data collection, data curation to modeling, evaluation and deployment. As a member of our dynamic group, you will have the unique and rewarding opportunity to craft upcoming research directions in the field of multimodal LLMs that will inspire future Apple products. We are seeking highly motivated and skilled engineers to join our Human Intelligence team. The ideal candidates will have strong backgrounds in developing and exploring capabilities of foundation models and agentic AI systems that enable natural, proactive and personalized human interactions. You will be responsible for multimodal LLM development including training, fine-tuning, agentic AI, and reasoning systems. In this role, you will work on cutting-edge research and engineering problems, collaborating across teams and help shape the technical direction of multimodal and agentic AI systems from research to production.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior