At BlueCross BlueShield of Tennessee, we’re building a member-facing virtual AI assistant that helps people get answers faster, navigate benefits with confidence, and reduce friction across the healthcare journey. To ensure this experience is safe, trustworthy, and consistently high-quality, we’re hiring an AI Evaluation Lead to own the evaluation strategy and the datasets that prove our assistant is working—before it reaches members and as it evolves over time. In this role, you’ll partner with digital product, technical, operations, compliance, and customer service teams to define what “good” looks like, build gold-standard datasets that reflect real member needs, and drive an enterprise-grade evaluation framework that improves performance, reduces risk, and accelerates responsible delivery. To do that, you’ll need: Hands-on experience with quality methodologies Strong business knowledge and data curation experience Proficiency with Python and analytics tooling; familiarity with conversational AI evaluation patterns If you’re inspired by our mission – peace of mind through better health – and ready for a role where your technical leadership directly influences AI strategy and systems at scale, we’d love to hear from you. Note: This is a fully remote role, but onsite interviews at our Chattanooga, TN headquarters may be required. Sponsorship is not available for this role.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level