Oracle-posted 3 months ago
Senior
Nashville, TN
5,001-10,000 employees

Join us in shaping the future of Kubernetes in the cloud with OKE. This is a unique opportunity to influence the direction of Kubernetes at OCI while innovating and advancing solutions in the AI infrastructure space. We’re looking for hands-on engineers with expertise and passion for solving difficult problems in distributed systems, virtualized infrastructure, and highly available services. If this is you, at Oracle, you can design and build innovative new systems from the ground up. These are exciting times in our space—we are growing fast, still at an early stage, and working on ambitious new initiatives. An engineer at any level can have a significant technical and business impact. The ideal candidate for this team is an experienced architect and proficient programmer with a wide breadth of knowledge and experience, including areas such as networking, storage, internet protocols, and operating systems. We write distributed, highly available systems to build, update, and deploy Kubernetes, plus automation and tooling for testing, deployments, and other needs.

  • Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.
  • Build and manage Kubernetes infrastructure tailored for extreme scalability.
  • Maximize operational efficiency and create innovative solutions for large-scale AI training and inference workloads.
  • Develop sophisticated tools and integrations for deploying and managing complex Kubernetes environments.
  • BS degree in Computer Science or related technical field involving coding or equivalent practical experience.
  • 8+ years of experience delivering and operating large-scale, highly available distributed systems.
  • 8+ years of working in large Java or Golang codebases.
  • Strong knowledge of data structures, algorithms, operating systems, and distributed systems.
  • Systematic problem-solving approach, strong communication skills, a sense of ownership, and drive.
  • Experience building large scale, multi-tenant, virtualized infrastructure.
  • Developing or managing containerized workloads using Kubernetes.
  • Experience in container networking or GPU AI/ML workloads and RDMA Clusters.
  • Experience with scripting languages such as Python, Perl, etc.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service