About The Position

Join us in building the machine learning platform that enables teams at Apple to build Apple Intelligence and many other intelligent experiences across hardware, software and service products. As a Machine Learning Data Platform Engineer, you'll design and build the scalable dataset management platform that enables teams across Apple to discover, curate, version, share, process, and consume ML datasets with enterprise-grade compliance and governance. We're looking for an engineer with deep expertise in big data infrastructure and a passion for building platforms that make ML practitioners more productive. You'll work at the intersection of large-scale data systems, ML workflows, and data governance. In this role, you'll be architecting and building Apple's next-generation ML dataset management platform. This platform enables ML teams across the company to efficiently manage the full lifecycle of datasets, from initial curation and annotation through versioning, model training and evaluation, sharing, and compliance. You'll design scalable infrastructure that supports dataset operations at massive scale while maintaining strong governance guarantees. Your work will include building data lineage tracking systems, implementing automated compliance workflows, creating intuitive APIs and SDKs for dataset access, and ensuring seamless integration with ML training and evaluation pipelines. You'll collaborate with teams building customer-facing ML features across iOS, macOS, and other Apple platforms, as well as compute infrastructure teams and ML framework owners. Your platform work directly enables the ML innovations that millions of customers experience daily. This role offers the opportunity to have broad impact across Apple's ML initiatives and to shape how thousands of ML practitioners build the intelligent experiences our customers love.

Requirements

  • Hands-on experience curating or managing datasets for production ML models.
  • Experience with data cataloging systems, metadata platforms, MLOps tools, or ML training frameworks.
  • Knowledge of privacy-preserving technologies and data quality/validation frameworks.

Responsibilities

  • Design and build the scalable dataset management platform for ML datasets.
  • Enable teams to discover, curate, version, share, process, and consume ML datasets.
  • Architect and build Apple's next-generation ML dataset management platform.
  • Manage the full lifecycle of datasets from curation to compliance.
  • Design scalable infrastructure for dataset operations at massive scale.
  • Build data lineage tracking systems and implement automated compliance workflows.
  • Create intuitive APIs and SDKs for dataset access.
  • Ensure seamless integration with ML training and evaluation pipelines.
  • Collaborate with teams building customer-facing ML features across various Apple platforms.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Career Level

Mid Level

Industry

Computer and Electronic Product Manufacturing

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service