Role Summary: Own the end-to-end lifecycle of memory features—from research to production. You’ll fine-tune models for extraction, updates, consolidation/forgetting, and conflict resolution; turn customer pain points into research hypotheses; implement and benchmark ideas from papers; and ship with Engineering to SOTA latency, reliability, and cost . You’ll also build evaluation at scale (offline metrics + online A/Bs) and close the loop with real-world feedback to continuously improve quality. What You'll Do: Fine-tune and train models for memory extraction, updates, consolidation/forgetting, and conflict resolution; iterate based on data and outcomes. Read, reproduce, and implement research : quickly prototype paper ideas, benchmark against baselines, and productionize what wins. Build evaluation at scale : automated relevance/accuracy/consistency metrics, gold sets, online A/B & interleaving, and clear dashboards. Work closely with customers to uncover pain points, turn them into research hypotheses, and validate solutions through field trials. Partner with Engineering to ship : design APIs and data contracts, plan safe rollouts, and maintain SOTA latency, reliability, and cost at scale.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed