Harvard T.H. Chan School Of Public Health - Cambridge, MA

posted about 1 month ago

Full-time - Senior
Hybrid - Cambridge, MA

About the position

As the Lead Data Scientist for Generative AI Products at Harvard Business School, you will spearhead the Data Science and Machine Learning team to develop innovative AI solutions that enhance engagement and education for a diverse range of stakeholders. This role involves translating stakeholder needs into user-facing applications utilizing NLP and large language models, while also guiding teams in ethical AI practices and advancing LLM applications. You will play a crucial role in architecting frameworks for GenAI products, optimizing model performance, and ensuring the safety and fairness of AI applications.

Responsibilities

  • Collaborate with the Data Science and Machine Learning team to create AI solutions for various stakeholders.
  • Translate cross-functional stakeholder needs into user-facing applications leveraging NLP techniques and LLMs.
  • Architect the overall framework and infrastructure for GenAI products like search interfaces and chatbots.
  • Develop and implement techniques to optimize model performance to meet specific product goals.
  • Guide engineering teams to effectively leverage LLM capabilities in product implementations.
  • Establish protocols for building fair, accountable, and transparent LLM-based applications.
  • Implement feedback pipelines and monitoring systems to ensure model safety.
  • Design and oversee the curation of high-quality datasets for LLM training.
  • Build data science pipelines from feature generation to model evaluation.
  • Communicate effectively with technical and non-technical audiences to foster understanding and engagement.

Requirements

  • Minimum of seven years' post-secondary education or relevant work experience.
  • Bachelor's/Advanced Degree in Mathematics, Physics, Computer Science, Engineering, Statistics, or 8+ years equivalent work experience.
  • 3-5 years of experience in developing machine learning models in a commercial environment.
  • Strong Python skills required.
  • Minimum of three years' experience building production NLP and deep learning models using PyTorch/Tensorflow.
  • Experience with large language model architectures (BERT, GPT-3, etc.).
  • Experience with production RAG pipelines and agentic information retrieval systems.
  • Proficiency with various prompting techniques and understanding of tradeoffs between prompting and finetuning.
  • Experience with cloud computing platforms - AWS.

Nice-to-haves

  • Proficiency in at least one open-source programming language (R, Java, C++ or another) and SQL.
  • Experience establishing model guardrails and developing bias detection techniques for AI applications.
  • Ability to mentor and lead others, providing hands-on technical guidance.
  • Experience working in agile methodology.

Benefits

  • Paid Time Off: 3-4 weeks of accrued vacation time per year, 12 accrued sick days, 12.5 holidays, and up to 12 weeks of paid leave for new parents.
  • Comprehensive medical, dental, and vision benefits, disability and life insurance programs.
  • Child and elder/adult care resources including on-campus childcare centers and Employee Assistance Program.
  • University-funded retirement plan with contributions from 5% to 15% of eligible compensation.
  • Tuition Assistance Program including $40 per class at the Harvard Extension School.
  • Professional Development programs and classes at little or no cost.
  • Various commuter options including discounted parking and public transportation passes.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service