Artificial Analysis is the leading independent AI benchmarking company. We support labs, engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier. Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist. We are a team of 35+, on track to triple by year end, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, DeepLearning.ai, Amazon), Adam D'Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders. Artificial Analysis maintains one of the most comprehensive language model benchmarking suites in the industry, evaluating frontier models across quality, speed, and pricing for the AI labs and enterprises that rely on our data. We're hiring a Solutions Engineer to own the day-to-day operation of our language model benchmarking stack. This is a hands-on, operational role: you'll add new models to our evaluation pipeline, run and debug benchmarks, and serve as the primary technical point of contact for AI lab customers — explaining results, fielding methodology questions, and resolving API endpoint issues over Slack and video calls. This is not a software engineering role focused on building new systems. It's about running a sophisticated existing stack exceptionally well, consistently and reliably, while being the trusted technical face of Artificial Analysis to our customers.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed