Machine Learning Engineer - Inference

Together AISan Francisco, CA
51d$160,000 - $230,000

About The Position

Together AI is seeking a Machine Learning Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference systems. This role involves working with state-of-the-art large language models models and ensuring they run efficiently and effectively at scale. If you are passionate about AI inference, PyTorch, and developing high-performance systems, we want to hear from you. This position offers the chance to collaborate closely with AI researchers and engineers to create cutting-edge AI solutions. Join us in shaping the future at Together AI!

Requirements

  • 3+ years of experience writing high-performance, well-tested, production-quality code.
  • Proficiency with Python and PyTorch.
  • Demonstrated experience in building high performance libraries and tooling.
  • Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, storage, performance, and scale.

Nice To Haves

  • Knowledge of existing AI inference systems such as TGI, vLLM, TensorRT-LLM, Optimum
  • Knowledge of AI inference techniques such as speculative decoding.
  • Knowledge of CUDA/Triton programming.
  • Knowledge of Rust, Cython and compilers.

Responsibilities

  • Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale.
  • Develop and optimize runtime inference services for large-scale AI applications.
  • Collaborate with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world.
  • Conduct design and code reviews to ensure high standards of quality.
  • Create services, tools, and developer documentation to support the inference engine.
  • Implement robust and fault-tolerant systems for data ingestion and processing.

Benefits

  • We offer competitive compensation, startup equity, health insurance, and other competitive benefits.
  • The US base salary range for this full-time position is $160,000 - $230,000 + equity + benefits.
  • Our salary ranges are determined by location, level, and role.
  • Individual compensation will be determined by experience, skills, and job-related knowledge.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service