Staff Software Engineer, Generative AI Inference

GoogleSeattle, WA
83d$197,000 - $291,000

About The Position

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward. We believe Generative AI (GenAI) inference will revolutionize our industry and we aim to make GKE the go-to platform for deploying these workloads. Large GenAI models present scaling and usability challenges with accelerators compared to traditional CPU workloads. As the creators of Kubernetes and with Google's extensive AI experience, we believe in GKE's ability to innovate in this space. We are looking for an ambitious, execution-oriented Software Engineers to help us make GKE the leading cost-effective, simplified, and fastest platform for running GenAI inference workloads. As a Software Engineer, you will joining the Inference Workload team, responsible for Gen AI Inference features, reliability and operations as well as simplifying and improving GenAI inference workload onboarding.

Requirements

  • Experience in software engineering with a focus on large-scale system design.
  • Knowledge of information retrieval, distributed computing, and networking.
  • Familiarity with artificial intelligence and natural language processing.
  • Ability to work on full-stack development.

Nice To Haves

  • Experience with Kubernetes and Google Cloud technologies.
  • Background in developing cost-effective and simplified platforms for AI workloads.

Responsibilities

  • Work on specific projects critical to Google's needs.
  • Switch teams and projects as the business evolves.
  • Display leadership qualities and take on new problems across the full-stack.
  • Contribute to the development of Generative AI inference features.
  • Ensure reliability and operations of GenAI inference workloads.
  • Simplify and improve GenAI inference workload onboarding.

Benefits

  • Base salary range of $197,000-$291,000.
  • Bonus and equity options.
  • Comprehensive benefits package.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Industry

Web Search Portals, Libraries, Archives, and Other Information Services

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service