Product Manager

Arena Intelligence, Inc.Bay Area, CA

About The Position

Arena Intelligence is seeking a Product Manager to lead their evaluations platform. This role is crucial as it sits at the intersection of fast-moving AI research and the development of trusted product infrastructure. The Product Manager will be responsible for translating emerging evaluation methodologies into scalable systems and experiences that influence how the AI ecosystem interprets model performance. This is a high-ownership role requiring strong systems thinking, technical depth, product judgment, and the ability to navigate ambiguity in a rapidly evolving field. The core challenge of this role is not traditional roadmap management, but rather defining how rapidly advancing AI research becomes trusted product infrastructure. The Product Manager will operate at the intersection of ML research, engineering, design, and product execution, translating emerging evaluation methodologies into systems and experiences that scale to millions of users and influence how the broader ecosystem interprets AI performance.

Requirements

  • 5–8 years of product management experience in highly technical or ambiguous environments.
  • Strong familiarity with modern AI systems, including LLMs, multimodal models, agents, reasoning systems, and evaluation methodologies.
  • A track record of shipping technically complex products from concept to production.
  • Experience translating research-heavy or technically ambiguous work into clear product direction and execution.
  • Strong systems thinking — you can identify bottlenecks, coordination gaps, and scaling constraints across technical and organizational systems.
  • Exceptional cross-functional leadership skills. You can align researchers, engineers, and designers without relying on formal authority.
  • High agency and strong product judgment. You move quickly, make decisions with incomplete information, and create structure where little exists.
  • Strong written communication. You can write specifications for researchers and product narratives for external technical audiences with equal clarity.

Nice To Haves

  • Technical background in computer science, machine learning, or related fields.
  • Prior experience in evaluations, benchmarking systems, AI infrastructure, research tooling, or developer platforms.
  • Experience building products for technical audiences such as researchers, ML engineers, or developers.
  • Founder or early-stage startup experience.

Responsibilities

  • Own the roadmap and product strategy for Arena's evaluations and leaderboard platform.
  • Partner closely with ML researchers to translate emerging evaluation methodologies — multimodal evals, agentic workflows, reasoning traces, and new benchmark categories — into production-quality product experiences.
  • Define how evaluation research moves from prototype → implementation → launch → ecosystem adoption.
  • Drive cross-functional execution across research, engineering, design, and marketing to close the gap between research artifacts and trusted user-facing infrastructure.
  • Prioritize what gets evaluated next based on frontier model trends, developer demand, ecosystem gaps, and strategic opportunities.
  • Build systems, workflows, and operational rigor around evaluation quality, release cadence, and leaderboard credibility.
  • Own product metrics across adoption, engagement, citations, frontier-lab participation, and evaluation throughput.
  • Engage directly with frontier labs, researchers, developers, and enterprise users to identify where current evaluation systems break down and where the ecosystem is headed next.
  • Help shape how Arena balances evaluation rigor, usability, neutrality, and speed as the platform scales.

Benefits

  • Competitive compensation and equity aligned to the markets where our team members are based.
  • Comprehensive health and wellness benefits, including medical, dental, vision, and additional support programs.
  • The opportunity to work on cutting-edge AI with a small, mission-driven team.
  • A culture that values transparency, trust, and community impact.
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service