About The Position

The Data Engineering team at Roblox plays a crucial role in enabling the company's success by developing and maintaining highly leveraged Core Data Sets, frameworks, and tooling to support the growing demand for analytics. As a Principal Data Engineer, you will work to define the data ontology for all of Roblox, establish best practices and standards for data operations and lifecycle management, design and build analytics tooling and frameworks, and influence event instrumentation. Additionally, this role is highly cross-functional, requiring close collaboration with Data Science, Experimentation, and Machine Learning teams to understand customer requirements and analytics applications, as well as with Data Infrastructure and Storage teams to develop integrated solutions. Join us and be a part of a dynamic team driving innovation and growth at Roblox.

Requirements

  • 8+ years of professional experience working building scalable ETL pipelines on industry standard ETL orchestration tools (Airflow, Dagster, Luigi, Google Cloud Composer, etc.) with deep expertise in SQL, PySpark, or scala.
  • 3+ years leading data engineering development directly with business or data science stakeholders
  • Built, scaled, and maintained Multi-Terabyte data sets and have an expansive toolbox for debugging and unblocking large scale analytics challenges (skew mitigation, sampling strategies, accumulation patterns, data sketches, etc.)
  • Experience with at least one major cloud's suite of offerings (AWS, GCP, Azure).
  • Developed or enhanced ETL orchestrations tools or frameworks
  • Worked within standard GitOps workflow (branch and merge, PRs, CI / CD systems)

Responsibilities

  • Partner with Data Science, Data Platform, Product, and Engineering to collect requirements to define the data ontology for all of Roblox
  • Lead and mentor a growing team of Data Engineers to support Roblox's ever-evolving data needs
  • Design, build, and maintain efficient and reliable batch and streaming data pipelines to model business entities as core data sets
  • Develop scalable frameworks and tooling to automate analytics workflows and streamline users interactions with data products
  • Establish and evangelize best practices for data operations and lifecycle management
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service