Data Scientist III

RemitlyPhiladelphia, PA
1d$110,487 - $115,600Remote

About The Position

Support our Data Sciences team within Health Content Operations and work with the Clinical Solutions and Education business units to provide research around data science and analytics that drives growth, revenue generation, outreach, and innovation. Focus on solving data science problems such as entity extraction, named-entity recognition, word-sense disambiguation, information retrieval, clustering, supervised and unsupervised learning using NLP, machine learning, deep learning, and statistical methods. Compare and recommend the use of latest GenAI technologies to solve these problems when the traditional approaches cannot solve them. Drive innovation by leveraging the latest research, literature, latest developments in GenAI, Responsible AI, GenAI Evaluation, RAG to Build POC’s and solve complex problems. Collaborate with inter disciplinary teams across organization to build solutions. Conduct research on concept indexing, relationship extraction, and data extraction from clinical data and scientific literature. Analyze vast amounts of unstructured data and design, prototype, and operationalize machine learning and automation solutions for our health business. Provide data analytics support including designing automated approaches for ontology and graph development, ontology validation and terminology mappings. Analyze extracted information to drive such processes as automated and manual data cleansing, linking, and populating knowledge graphs. Coordinate with stakeholders as needed. Manage contractors as needed to get new entities reviewed for ingestion into Emmet. Ensure compliance with DPR documentation. Lead the Rising Tide program and Mentor interns. Coordinate with IT developers and (content) subject matters experts to translate information needs into data science solutions. Drive new developments and implement process changes and disruptive technologies in the organization. Perform other duties as needed.

Requirements

  • Master’s degree (or foreign equivalent) in Data Science, Data Analytics, Enterprise Intelligence, or a related field required.
  • 2 years of experience in job offered or related occupations required.
  • 2 years of experience: with doctor, nurse, and patient information needs to design Data Science, Machine Learning (ML) and Natural Language Processing (NLP) solutions to improve patient outcomes; working with deep learning models, neural networks, and state-of-the-art transformer language models, putting data science into production; utilizing nix systems, open-source software, Jupyter notebook hubs, cloud computing, MATLAB for data modeling, machine learning and visualization purposes, and Java, Python or R; and with utilization of database, data manipulation, and visualization tools, such as MySQL, Excel, and Tableau.

Responsibilities

  • Solving data science problems such as entity extraction, named-entity recognition, word-sense disambiguation, information retrieval, clustering, supervised and unsupervised learning using NLP, machine learning, deep learning, and statistical methods.
  • Compare and recommend the use of latest GenAI technologies to solve these problems when the traditional approaches cannot solve them.
  • Drive innovation by leveraging the latest research, literature, latest developments in GenAI, Responsible AI, GenAI Evaluation, RAG to Build POC’s and solve complex problems.
  • Collaborate with inter disciplinary teams across organization to build solutions.
  • Conduct research on concept indexing, relationship extraction, and data extraction from clinical data and scientific literature.
  • Analyze vast amounts of unstructured data and design, prototype, and operationalize machine learning and automation solutions for our health business.
  • Provide data analytics support including designing automated approaches for ontology and graph development, ontology validation and terminology mappings.
  • Analyze extracted information to drive such processes as automated and manual data cleansing, linking, and populating knowledge graphs.
  • Coordinate with stakeholders as needed.
  • Manage contractors as needed to get new entities reviewed for ingestion into Emmet.
  • Ensure compliance with DPR documentation.
  • Lead the Rising Tide program and Mentor interns.
  • Coordinate with IT developers and (content) subject matters experts to translate information needs into data science solutions.
  • Drive new developments and implement process changes and disruptive technologies in the organization.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service