Data Scientist

RochePleasanton, CA
1d

About The Position

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters. The Position As a Data Scientist you will have a strong foundation in machine learning (ML), data science, and software engineering. You will have practical experience in building and deploying ML models and developing AI agents, particularly for tasks involving unstructured/structured data and workflow automation.

Requirements

  • Proficient in a wide range of ML algorithms, from traditional models like linear regression and decision trees to more advanced deep learning architectures such as Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs).
  • Understand the principles behind model training, validation, and hyperparameter tuning.
  • For extracting information from unstructured text, strong NLP skills are essential.
  • Experience with techniques like tokenization, sentiment analysis, named entity recognition, topic modeling, and using pre-trained language models like BERT, GPT, or others from the Hugging Face ecosystem.
  • Adept at working with various data formats and have experience in data cleaning, preprocessing, and transforming raw data into useful features for ML models.
  • Handling missing values, encoding categorical data, and scaling numerical features.
  • Proficiency in Python is a must, along with a solid understanding of key libraries like Scikit-learn, Pandas, TensorFlow, and PyTorch.
  • Experience with MLOps (Machine Learning Operations) practices, including model versioning, monitoring, and deployment on cloud platforms (AWS, Azure, or GCP), is crucial for building and maintaining robust solutions.
  • Understand the components of an AI agent, including a Large Language Model (LLM) as the brain, tools for specific tasks, and a logical structure for decision-making.
  • Practical experience in designing and implementing automated workflows.
  • Integrating AI agents and ML models into existing business processes.
  • Able to identify bottlenecks, map out a solution, and build the necessary connectors or APIs to execute tasks automatically.
  • Demonstrate expertise in handling various forms of unstructured data, including text, images, and audio.
  • Building pipelines to ingest, process, and analyze this data to extract meaningful insights or trigger actions.
  • Problem-Solving: The ability to break down complex business problems into manageable, data-driven solutions is key.
  • Able to think critically and creatively to solve real-world challenges.
  • Communication: A great candidate can clearly articulate technical concepts to non-technical stakeholders, explaining the "why" and "how" of their solutions.
  • Vital for collaborating with different teams and ensuring the project meets business goals.
  • Business Acumen: The best candidates understand the business context of their work.
  • Able to connect their technical solutions directly to a positive impact on the company's bottom line or operational efficiency.

Responsibilities

  • Machine Learning and Deep Learning
  • Natural Language Processing (NLP)
  • Data Handling and Feature Engineering
  • Programming and MLOps
  • AI Agent Architectures
  • Workflow Automation
  • Unstructured Data
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service