Elicit is an AI research platform that uses language models to help researchers figure out what's true and make better decisions, starting with common research tasks like literature review. What we're aiming for: Elicit radically increases the amount of good reasoning in the world. For experts, Elicit pushes the frontier forward. For non-experts, Elicit makes good reasoning more affordable. People who don't have the tools, expertise, time, or mental energy to make well-reasoned decisions on their own can do so with Elicit. Elicit is a scalable ML system based on human-understandable task decompositions, with supervision of process, not outcomes. This expands our collective understanding of safe AGI architectures. Visit our Twitter to learn more about how Elicit is helping researchers and making progress on our mission. The mission of Elicit evals Some orgs build evals to warn us about dangerous capabilities. Others build evals to understand trends and predict future developments. Yet others build evals to hill-climb towards models that users will like more. At Elicit, we're focused on something different—we want to understand, and hill-climb towards, models that help us make better decisions. This is tougher than "what will users like better"—it's hard to evaluate decision support, and users' knee-jerk reactions may not align with what actually helps for decision-making. Because it's hard, and because the sales pitch is more complicated, there aren't many doing this well. If we nail this, we have a unique opportunity to push AI toward helping us make better decisions, both within Elicit and beyond. Why we're hiring for this role We need someone to own the technical foundation of our auto-evaluation systems. Our evals are currently much slower than they need to be, and our interfaces aren't optimized for the diverse set of people who need to use them—ML engineers iterating on models, product managers monitoring quality, and customers assessing trust in results. The right person for this role won't just build infrastructure. You'll think deeply about what it actually means for Elicit to help with decision-making in pharma and encode that understanding into our evaluation systems.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
11-50 employees