About The Position

Join us in building the machine learning platform that enables teams at Apple to build Apple Intelligence and many other intelligent experiences across hardware, software and service products. As a Machine Learning Data Platform Engineer, you'll design and build the scalable dataset management platform that enables teams across Apple to discover, curate, version, share, process, and consume ML datasets with enterprise-grade compliance and governance. We're looking for an engineer with deep expertise in big data infrastructure and a passion for building platforms that make ML practitioners more productive. You'll work at the intersection of large-scale data systems, ML workflows, and data governance.

Requirements

  • Bachelor's degree in Computer Science, related field, or equivalent practical experience.
  • 10+ years building and scaling data infrastructure for petabyte-scale ML workloads with high reliability.
  • Deep expertise in modern data technologies (Apache Iceberg, Spark, S3, distributed systems), data modeling, schema evolution, and efficient storage formats (Parquet, Arrow, ORC).
  • Experience building data pipelines that handle diverse ML data types: structured/tabular data, unstructured media (images, video, audio), embeddings, and multimodal datasets.
  • Proven track record building dataset management systems including versioning, metadata management, discovery, and integration with production ML training pipelines.
  • Experience designing data governance frameworks including lineage tracking, access control, retention policies, and compliance workflows.
  • Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes).
  • Strong cross-functional collaboration skills to understand diverse stakeholder needs and articulate technical decisions across ML engineering, data science, legal, and product teams.

Nice To Haves

  • Hands-on experience curating or managing datasets for production ML models.
  • Experience with data cataloging systems, metadata platforms, MLOps tools, or ML training frameworks.
  • Knowledge of privacy-preserving technologies and data quality/validation frameworks.

Responsibilities

  • Architect and build Apple's next-generation ML dataset management platform.
  • Enable ML teams across the company to efficiently manage the full lifecycle of datasets.
  • Design scalable infrastructure that supports dataset operations at massive scale while maintaining strong governance guarantees.
  • Build data lineage tracking systems and implement automated compliance workflows.
  • Create intuitive APIs and SDKs for dataset access.
  • Ensure seamless integration with ML training and evaluation pipelines.
  • Collaborate with teams building customer-facing ML features across iOS, macOS, and other Apple platforms.

Benefits

  • Comprehensive medical and dental coverage.
  • Retirement benefits.
  • Discounted products and free services.
  • Reimbursement for certain educational expenses, including tuition.
  • Opportunity to participate in Apple's discretionary employee stock programs.
  • Eligibility for discretionary bonuses or commission payments.
  • Relocation assistance.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Industry

Computer and Electronic Product Manufacturing

Education Level

Bachelor's degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service