About The Position

At Oracle Cloud Infrastructure (OCI), we are redefining the future of computing for enterprises—building cloud-native systems from the ground up, powered by a global team of visionary engineers, scientists, and creators. We combine the agility of a startup with the scale, security, and reach of Oracle’s enterprise-grade platforms. Our Generative AI Service team is pioneering the development of infrastructure and services that harness the transformative power of Large Language Models (LLMs) and Agentic AI systems. Our mission is to build world-class, scalable platforms that enable customers to deploy intelligent agents and applications, deeply integrated with OCI’s robust cloud ecosystem. As a Consulting Member of Technical Staff (IC5), you will play a pivotal role in designing, building, and optimizing LLM infrastructure, agent execution runtimes, and next-generation developer platforms. You'll collaborate closely with applied scientists and ML engineers to bring agentic workflows into real-world deployments—at scale. This is a hands-on technical leadership role, ideal for someone deeply rooted in distributed systems and low-level computer science.

Requirements

  • BS in Computer Science or equivalent experience.
  • 10+ years of experience in production-grade distributed systems and cloud-native software engineering.
  • Proficiency in Go, Java, Python, or C++.
  • Expertise in high-performance computing and ML model serving infrastructure.
  • Deep understanding of container orchestration and CI/CD pipelines.
  • Strong communication skills and experience mentoring across teams.

Nice To Haves

  • MS or PhD in Computer Science, particularly in Systems, ML Infrastructure, or Compilers.
  • Experience with LLM serving frameworks like vLLM, FasterTransformer, DeepSpeed, or Triton.
  • Familiarity with agent-based systems.
  • Contributions to LLM-native developer tools and compiler IRs.
  • Experience with vector databases, tool APIs, and event-driven workflows.
  • Foundation in OS internals, compiler pipelines, and systems programming.
  • Proven ability to lead large-scale architecture efforts.

Responsibilities

  • Design, build, and optimize LLM infrastructure and agent execution runtimes.
  • Collaborate with applied scientists and ML engineers to implement agentic workflows.
  • Lead technical initiatives in distributed systems and cloud-native software engineering.

Benefits

  • Be at the frontier of generative AI and agent-based software interaction.
  • Work on mission-critical projects impacting Oracle’s AI strategy.
  • Collaborate with a globally distributed team of leading engineers and researchers.
  • Enjoy the agility of a fast-moving team with enterprise-level resources.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Senior

Education Level

Bachelor's degree

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service