Serve as the technical authority for AI quality, evaluation, and risk controls within the Commercial AI Center of Excellence (CAI CoE). Define and operationalize technical mechanisms that ensure Vertex AI systems meet tax-grade standards for accuracy, defensibility, and compliance. Design and implement AI evaluation frameworks, including gold datasets, regression testing, and CI gates; define metrics across accuracy, confidence calibration, citation coverage, and abstention correctness. Lead model validation, error analysis, telemetry, and drift detection strategies; establish best practices for GenAI and ML hybrid systems. Partner with Legal, Security, and Risk teams to translate policy into enforceable technical mechanisms; review high-risk AI use cases and guide product teams on mitigation strategies. Influence standards and practices across large engineering organizations while partnering with product teams to improve model performance, reliability, and adoption.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed