We’re building a gamified developer platform where tens of thousands of engineers create high‑fidelity datasets that push LLM frontiers. This role owns the technical lifecycle of data pipelines—from defining new data formats with partner labs to shipping the tooling, environments, docs, and QA that make those formats real at scale.