Oracle Health is seeking an AI Platform Reliability Engineer to ensure our AI agent platform and AI-enabled analytics workflows are reliable, observable, measurable, and safe in production. This role will focus on the operational foundation for production AI systems, including monitoring, tracing, evaluation in production, rollback controls, alerting, versioning, runtime diagnostics, and quality safeguards. The engineer will also support data reliability use cases such as detection of stopped processing, data gaps, freshness issues, schema drift, and anomaly conditions that affect downstream analytics and reporting. The ideal candidate brings strong engineering discipline in observability, release safety, and operational tooling, with the ability to apply those skills to modern AI and agent-based systems. This role is critical to maintaining trust in AI outputs and ensuring new capabilities can scale safely across Oracle Health.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Senior
Education Level
No Education Listed
Number of Employees
5,001-10,000 employees