The Oracle Cloud Infrastructure (OCI) Compute team delivers bare metal and virtual machines, including CPUs and GPUs, at scale. Given the rapid growth in machine learning, the performance and efficiency of these cloud services are critical. The Core Architecture team focuses on identifying and addressing performance and efficiency constraints throughout the entire lifecycle of compute services, from inventory management and capacity ingestion to placement, repair, and decommissioning. Consulting engineers are tasked with performing in-depth analysis of business problems and then proposing and incubating new automated solutions to meet the demands of Oracle's largest customers. This role involves leading the architectural definition for new host lifecycle management capabilities that will power the next generation of the Compute Control Plane. This initiative spans various Compute domains, such as GPU validation and repairs, and requires driving engineers from these organizations to develop cohesive, microservice-based solutions to enable Compute to scale with growing customer demands. The ideal candidate is a hands-on senior engineer with broad technical expertise, proven experience in solving cloud-scale problems, and extensive experience in distributed systems design and implementation to build fault-tolerant solutions that will form the foundation of future Compute offerings. Strong written and verbal communication skills, the ability to lead projects across organizational boundaries, and experience presenting work to senior leaders are essential.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal
Number of Employees
5,001-10,000 employees