As an Applied Machine Learning Engineer - Evaluations at Hippocratic AI, you'll be at the core of how we measure, understand, and improve our voice-based generative AI healthcare agents. Your work will translate complex, qualitative notions of empathy, safety, and accuracy into quantitative evaluation signals that guide model iteration and deployment. You'll design and implement evaluation harnesses, analysis tools, and visualization systems for multimodal agents that use language, reasoning, and speech. Partnering closely with research, product, and clinical teams, you'll ensure every model update is grounded in data, validated against real-world scenarios, and continuously improving in both intelligence and bedside manner. This is a hands-on, experimental role for ML engineers who care deeply about quality, safety, and user experience-and who thrive at the intersection of research and product.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
101-250 employees