As part of the AWS Applied AI Solutions organization, we have a vision to provide business applications, leveraging Amazon's unique experience and expertise, that are used by millions of companies worldwide to manage day-to-day operations. We accomplish this by accelerating our customers' businesses through delivery of intuitive and differentiated technology solutions that solve enduring business challenges. We blend vision with curiosity and Amazon's real-world experience to build opinionated, turnkey solutions. We are looking for a Senior Product Manager, Technical to define and drive the product vision for AI Agent Evaluations within our Core Services AI Foundations team. You will own the end-to-end evaluation framework that enables application development teams to measure, benchmark, and continuously improve the quality, safety, and reliability of their AI-powered agents. This includes defining the product strategy for evaluation tooling, quality scoring methodologies, regression testing frameworks, and human-in-the-loop review workflows that give builders confidence their agents perform as intended before and after deployment. You will work backward from the needs of development teams in Applied AI Solutions and AWS building agentic AI applications and define how they assess agent behavior across dimensions including correctness, safety, groundedness, and customer satisfaction. You will partner closely with engineering and applied science teams to translate complex evaluation methodologies into intuitive, self-service products that scale across diverse use cases and agent architectures. The Core Services AI Foundations team within AWS Applied AI Solutions builds the foundational platform that enables application development teams to ship production-grade AI agents and applications with confidence. We provide the shared infrastructure, tooling, and guardrails that handle the hardest cross-cutting concerns in agentic AI: evaluation, identity and access management, observability and analytics, data and knowledge management, foundational agents, and user experience.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior