We are seeking our first AI Engineer specializing in agents and evaluation. This foundational role will be instrumental in shaping how we build, measure, and scale intelligent systems. The opportunity involves designing the playbook for high-performance AI agents, tackling the complex challenge of helping developers understand, evolve, and operate sophisticated systems using autonomous and event-driven AI. In this position, you will develop the evaluation frameworks, task harnesses, and orchestration strategies essential for making our agents reliable, testable, and truly valuable. Your contributions will directly enhance our agents and also generate reusable benchmarks and artifacts that can foster innovation and advance the broader foundation model ecosystem. This role is ideal for individuals who excel at designing experiments, constructing systems, and integrating theory with code in a research-engineering capacity, particularly in a 0-to-1 environment.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior