88-50101671 Principal Data Scientist

RocheSouth San Francisco, CA
12d$220,000 - $329,300Hybrid

About The Position

The Position Genentech, Inc. seeks a Principal Data Scientist at its South San Francisco, CA location. Duties: Apply statistical theory and methods to lead projects to design, develop, and program methods, processes, and systems to consolidate and analyze unstructured and diverse “big data” sources to generate actionable, persuasive information and insights and innovative solutions for client services and product enhancement. Use statistical and visualization techniques to inform feature engineering, model selection, and optimization of LLM-based applications. Design data pipelines to curate, preprocess, and structure datasets that improve LLM-based algorithms performance and reduce biases, with a focus on data quality and diversity. Perform thorough data exploration to understand dataset characteristics, uncover patterns, detect biases, and identify data quality issues. Lead research on scientific approach and utilize state-of-art methodologies to analyze complex datasets and interpret analysis of results. Develop and code software programs, algorithms, and automated processes to cleanse, integrate, and evaluate large datasets from multiple disparate sources. Provide methodical and implementation guidance as well as hands-on support and be accountable for the development and implementation of Data Science products. Collaborate with AI engineers, product owners, business analysts, and other developers in Agile teams to integrate LLMs into scalable, robust, fair, and ethical end-user applications, focusing on user experience, relevance, and real-time performance. Design, develop, customize, optimize, and fine-tune LLM-based and other AI-infused algorithms tailored to specific use cases such as text generation, summarization, information extraction, chatbots, AI agents, code generation, document analysis, sentiment analysis, and data analysis, among others. Evaluate the pros and cons of different approaches and Generative AI platforms with comprehensive quantitative and qualitative analysis. Collaborate within global Agile teams in the Informatics business and foundational domains to develop products that provide the highest value to both Pharma and Diagnostics business stakeholders. Serve as a technical expert and resource for the team and clients, contributing to building and cultivating a data-driven decision-making culture. Telecommuting permitted up to 3 days per week.

Requirements

  • PhD in Biostatistics, Data Science, Mathematics, a Biological Science, or a related field and 3 years of industry experience in the job offered or as a Data Scientist or related position
  • 3 years of experience with the following in the healthcare or biotechnology industry: Application of statistical modeling, machine learning, exploratory, and confirmatory data analysis to mid to large volume data sets
  • Analytic study design, including retrospective
  • Big data management
  • Python and R programming
  • Relational database principles and SQL
  • Conducting analyses with electronic health records
  • Data Visualization Techniques
  • Integrating data science products and LLM-based applications into scalable, robust end-user software systems
  • Developing and customizing Generative AI solutions for specific use cases such as text generation, information extraction, chatbots, and AI agents
  • Fine-tuning and optimization of Large Language Models (LLMs), to analyze unstructured and diverse data sources.

Responsibilities

  • Apply statistical theory and methods to lead projects to design, develop, and program methods, processes, and systems to consolidate and analyze unstructured and diverse “big data” sources to generate actionable, persuasive information and insights and innovative solutions for client services and product enhancement.
  • Use statistical and visualization techniques to inform feature engineering, model selection, and optimization of LLM-based applications.
  • Design data pipelines to curate, preprocess, and structure datasets that improve LLM-based algorithms performance and reduce biases, with a focus on data quality and diversity.
  • Perform thorough data exploration to understand dataset characteristics, uncover patterns, detect biases, and identify data quality issues.
  • Lead research on scientific approach and utilize state-of-art methodologies to analyze complex datasets and interpret analysis of results.
  • Develop and code software programs, algorithms, and automated processes to cleanse, integrate, and evaluate large datasets from multiple disparate sources.
  • Provide methodical and implementation guidance as well as hands-on support and be accountable for the development and implementation of Data Science products.
  • Collaborate with AI engineers, product owners, business analysts, and other developers in Agile teams to integrate LLMs into scalable, robust, fair, and ethical end-user applications, focusing on user experience, relevance, and real-time performance.
  • Design, develop, customize, optimize, and fine-tune LLM-based and other AI-infused algorithms tailored to specific use cases such as text generation, summarization, information extraction, chatbots, AI agents, code generation, document analysis, sentiment analysis, and data analysis, among others.
  • Evaluate the pros and cons of different approaches and Generative AI platforms with comprehensive quantitative and qualitative analysis.
  • Collaborate within global Agile teams in the Informatics business and foundational domains to develop products that provide the highest value to both Pharma and Diagnostics business stakeholders.
  • Serve as a technical expert and resource for the team and clients, contributing to building and cultivating a data-driven decision-making culture.

Benefits

  • A discretionary annual bonus may be available based on individual and Company performance.
  • This position also qualifies for the benefits detailed at the link provided below.
  • Benefits (https://roche.ehr.com/default.ashx?CLASSNAME=splash)

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Principal

Education Level

Ph.D. or professional degree

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service