Data Platform Engineer, Fauna

AmazonNew York, NY

About The Position

We are looking for a Data Platform Engineer to build the foundational data systems that power our robotics and machine learning development. In this role, you will design and implement the infrastructure for collecting, storing, processing, and transforming the vast amounts of data generated by our robots—from sensor telemetry and video streams to operational logs and performance metrics. You'll work closely with ML teams to ensure data is accessible, well-structured, and ready for training. Your work will enable research teams to iterate faster and operations teams to monitor fleet performance as we scale. Fauna Robotics, an Amazon company, is building capable, safe, and genuinely delightful robots for everyday life. Our goal is simple: make robots people actually want to live and interact with in everyday human spaces. We believe that future won’t arrive until building for robotics becomes far more accessible. Today, too much effort is spent reinventing the fundamentals. We’re changing that by developing tightly integrated hardware and software systems that make it faster, safer, and more intuitive to create real-world robotic products. Our work spans the full stack: mechanical design, control systems, dynamic modeling, and intelligent software. The focus is not just functionality, but experience. We’re building robots that feel responsive, expressive, and genuinely useful. At Fauna, you’ll work at the frontier of this space, helping define how robots move, manipulate, and interact with people in natural environments. It’s an opportunity to solve hard problems across hardware and software with a team focused on making robotics accessible and joyful to build. If you care about making robotics real for everyone and building systems that are as delightful as they are capable, we’re interested in hearing from you.

Requirements

  • Bachelor's degree or above in computer science, computer engineering, or related field, or experience in data science, machine learning or data mining
  • 3+ years of data engineering experience
  • Experience in scripting for automation (e.g. Python) and advanced SQL skills.
  • Experience in Kafka, or experience in Hive/Spark/Hbase/Yarn and experience in software development
  • Experience with cloud computing technologies
  • Knowledge of distributed systems as it pertains to data storage and computing
  • Proficiency with data storage technologies (e.g., PostgreSQL, object storage)

Nice To Haves

  • Experience working with robotics or IoT data (time-series, video, point clouds)
  • Knowledge of streaming architectures and real-time analytics
  • Familiarity with ML techniques and how data preparation impacts model training
  • Experience with data cataloging, metadata management, and data discovery tools

Responsibilities

  • Design and build scalable data pipelines for ingesting and processing robotics data (sensor streams, video, telemetry, logs)
  • Develop and maintain data storage solutions optimized for diverse data types and access patterns
  • Create tools and APIs for researchers and engineers to efficiently query and analyze large datasets
  • Build real-time data processing systems for monitoring robot fleet performance
  • Build and maintain data transformation pipelines that prepare robotics data for ML training
  • Collaborate with ML and robotics teams to ensure data platforms meet their evolving needs

Benefits

  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service