We are hiring a researcher with a strong technical background in Image/Video generation and editing, as well as Multimodal Foundation Models. You will play a critical role in the research and development of multimodal foundation models for image/video/3D generation, editing, animation, and many more. As a member of the team, you will have the opportunity to develop fundamental model capabilities, collaborate with team members with diverse backgrounds to work on ambitious projects, and collaborate broadly across Apple with world-class engineers and researchers to advance our products and delight millions of users. DESCRIPTION As a member of our fast-paced group, you’ll have the unique and rewarding opportunity to shape upcoming products from Apple. We are looking for people with excellent applied machine learning, computer vision/graphics experience, and solid engineering skills in creating outstanding model capabilities and product features. This role will have the following responsibilities: - Developing, fine-tuning, and evaluating foundational image generation and image editing models, as well as unified multimodal foundation models capable of both visual understanding and generation. - Developing, fine-tuning, and evaluating domain-specific image generation and editing models for various tasks and applications in Apple’s AI-powered products. - Conducting innovative research and transferring pioneering research in generative AI to production-ready technologies. - Understanding product requirements, translating them into modeling tasks and engineering tasks.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
Ph.D. or professional degree
Number of Employees
5,001-10,000 employees