About The Position

Adobe Firefly Applied Science & Machine Learning (ASML) is looking for a Manager, Applied Science – Generative AI Data Research to lead a team conducting multimodal data research and build datasets that support training and development of foundation models for generative AI multimodal generation ( image, video, audio) The role is for a senior manager who has experience with data for foundation model training, software engineering background, and applied ML . You will lead a team responsible for developing datasets for foundational model training. This involves content acquisition, filtering, curation and media understanding, and performing model ablations to validate data before large-scale training. The focus of this role is on large-scale data which supports foundation model training in all its stages. You will work closely with applied researchers, ML engineers, and product partners to turn evolving research and product needs into scalable, maintainable data pipelines and datasets . Success in this role requires strong engineering judgment, comfort with ambiguity, and the ability to guide both system design and applied ML workflows , while scaling impact through effective team leadership and execution.

Requirements

  • MS or PhD in Computer Science, Engineering, AI/ML, or a related technical field, or equivalent practical experience
  • At least 4+ years of experience leading or managing technical teams in AI/ML domains, with a large component of working on data.
  • Strong hands-on experience working on model training and data.
  • Ability to operate as a technical leader , making sound design tradeoffs and unblocking complex engineering problems
  • Strong communication skills and the ability to collaborate across research, engineering, and product organizations
  • Comfort working in fast-moving, ambiguous environments typical of generative AI development

Responsibilities

  • Lead and grow a team of applied scientists and engineers working on multimodal data research and generative AI training systems
  • Own and evolve key parts of the training and experimentation datasets.
  • Partner closely with applied research and engineering teams to support the full lifecycle from experimentation to production
  • Drive technical design and architecture decisions for large-scale data pipelines and dataset creation and maintenance.
  • Remain hands-on enough to review designs, guide implementation choices, and unblock complex technical issues
  • Establish standard processes for system robustness, testing, observability, and reproducibility
  • Drive execution against milestones while adapting to changing research and product priorities
  • Build a strong team culture focused on ownership, collaboration, and continuous improvement
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service