Rakuten-posted 2 days ago
Full-time • Mid Level
Boston, MA
5,001-10,000 employees

Rakuten International is a division of Rakuten Group, Inc., a Japanese global technology leader in services that empower individuals, communities, businesses and society. Headquartered in San Mateo, California with more than 4,000 employees worldwide, the Rakuten International business portfolio includes market leaders in e-commerce, digital marketing, advertising, communications and entertainment. We create products and services that provide exceptional value by aligning members and the businesses that want to engage them in a shared community. Rakuten Institute of Technology (RIT) is the corporate R&D department of the Rakuten group and is responsible for leading strategic technological progress. RIT’s targets vary from basic technologies to innovative new services. RIT consists of highly motivated research scientists located in Tokyo, Boston, Paris, Singapore, San Mateo and Bengaluru who are driven by strong curiosity. We are seeking an innovative and dedicated r esearch s cientist to join RIT. As a n Associate Research Scientist, you will play a pivotal role in shaping the future of AI within Rakuten's diverse ecosystem. Our team focuses on enhancing in-house Large Language Models (LLMs) and seamlessly integrating them into Rakuten's services, impacting millions of users worldwide. We are a collaborative and innovative group, working on cutting-edge research and fostering a culture of knowledge sharing. The ideal candidate will have a passion for producing groundbreaking research and collaborat ing with teams across Rakuten to deliver impactful AI solutions. We welcome exceptional researchers and engineers from other fields who may not have direct experience in generative AI to apply. This role provides an opportunity to gain the critical skills and knowledge needed to transition into the AI & ML space. We are committed to supporting career changers who bring unique perspectives and expertise, and we offer mentorship and resources to facilitate your growth in the AI and ML domains.

  • Develop and train LLMs, including data processing, pre-training, and fine-tuning. This also includes designing and running experiments, writing code, and evaluating results.
  • Implement and verify latest technologies in Deep Learning, LLMs and AI agent fields, and reflect the implementation into in-house machine learning libraries.
  • Work collaboratively with globally distributed people in a range of roles, including researchers, engineers, product managers, designers, and other key product stakeholders to accomplish complex tasks that deliver value to our business.
  • Present findings and insights in team meetings and contributing to project documentation.
  • Stay up to date on research and product advances in AI.
  • Proven experience in a research-focused role within industry, academia, or government institutions.
  • Experience participating in machine learning projects , having completed the entire cycle of model training , experimental design , and evaluation .
  • Basic knowledge and understanding of natural language processing , deep learning and generative AI, including LLMs , RAG, and AI agents , and a passion for staying at the forefront of these technologies .
  • High- level coding skills in Python, with a strong understanding of software engineering principles .
  • Experience in pre-training / fine-tuning existing LLMs (> 1B) using PyTorch .
  • Strong communication skills to understand and effectively convey technical information .
  • A strong emphasis on teamwork and a proactive attitude towards learning.
  • Masters in Computer Science , Machine Learning or related field with 2 + years of experience in relevant industry experience.
  • Ability to speak, write, and communicate in Japanese is a plus
  • Experience presenting as first author at top-tier peer-reviewed conferences.
  • Familiarity with efficient training and inference techniques such as LoRA , pruning, knowledge distillation, and mixed precision training.
  • Experience with distributed training frameworks such as DeepSpeed and FSDP.
  • Experience with multilingual and multimodal modeling.
  • Experience in developing software to process a large-scale dataset using a distributed computing framework (e.g., Hadoop)
  • PhD in in Computer Science, Machine Learning or related field preferred
  • health, vision, dental insurance
  • 401k matching
  • PTO
  • Volunteer Time Off (VTO)
  • discretionary bonus
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service