Data Scientist II

Children’s Hospital of PhiladelphiaPhiladelphia, PA
3d

About The Position

The Data Science and Biostatistics Unit (DSBU) within the Department of Biomedical and Health Informatics (DBHi) collaborates with researchers across CHOP’s Research Institute to provide data management, analysis and consultation on methods development, study design, and data analysis. This interdisciplinary team consists of biostatisticians, epidemiologists, data scientists, data analysts and other methodologists. Staff have expertise managing and using diverse multimodal healthcare data from varied data sources and registries associated with electronic health records, clinical trials, administrative, claims, and surveys. DSBU staff have experience across various machine learning and statistical methodologies, including supervised & unsupervised learning, NLP, computer vision, propensity score matching, multivariate modeling, latent variable mixture models, and geographic information systems. In order to support DSBU’s rapid growth in meeting the analysis needs of CHOP’s research teams, we are recruiting a Data Scientist II. This individual will provide consistently high-quality deliverables within reasonable timeframes and will collaboratively work with principal investigators from the Research Institute to achieve meaningful results and academic output. The ideal individual will work assist in tasks using computer vision and/or natural language processing to answer questions involving image and video healthcare data and unstructured text data. The Data Scientist II will implement selected methodologies, perform data manipulation, data management, and modeling using machine learning and statistical methods and report to a Data Scientist Supervisor within the DSBU.

Requirements

  • Bachelor's Degree - Required
  • At least three (3) years experience with progressively more complex data science, applied statistics, machine learning, or mathematical modeling projects - Required
  • Experience and demonstrated ability acquiring new technical/analytic skills and domain knowledge to support successful contribution to research and development projects is required.
  • Experience formulating or contributing to the formulation of analysis plans and selection of appropriate methods.
  • Experience using existing machine learning and analytic tools such as ScikitLearn, Weka, R, and Mathematica in either applied educational or professional projects is required.
  • Experience writing code in either applied educational or professional projects using one or more of the following languages: Python, Scala, Java is required.
  • Familiarity with relational databases (e.g. Postgres, MySQL) strongly preferred.
  • Computer vision and natural language processing (NLP) experience, particularly in the biological and medical domains, is strongly preferred
  • Strong verbal and written communications skills with the demonstrated ability to explain complex technical concepts to a lay audience.

Nice To Haves

  • Bachelor's Degree Analytics, Data Science, Statistics, Mathematics, Computer Science or a related field - Preferred
  • At least four (4) years with progressively more complex data science, applied statistics, machine learning, or mathematical modeling projects - Preferred
  • Applied statistics or mathematical modeling experience preferred.
  • Natural language processing experience particularly in the biological and medical domains preferred.
  • Experience using distributed computing technologies (e.g. Akka, MapReduce, Cuda) preferred.
  • Familiarity with graph, key value, and document data stores (e.g. Neo4j, Hadoop, MongoDB) preferred.
  • Experience creating informative visualizations for complex, high dimensional data preferred.
  • Experience with probabilistic graphical models, time series predictive models, Markov models preferred.

Responsibilities

  • Implement computational algorithms and experiments for test and evaluation; interprets data to assess algorithm performance.
  • Make significant contributions to the formulation of analysis plans and associated documentation of methods that meet stringent criteria for reproducibility and measures of significance.
  • Develop high-quality code implementing models and algorithms as application programming interfaces or other service-oriented software implementations.
  • Participate in communication of research methods, implementation, and results to varied audience of clinicians, scientists, analysts, and programmers.
  • Work closely with applications research group to translate models and algorithms into engineered production applications.
  • Contribute to manuscript writing for results publication, authors abstracts, and presents at professional conferences.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service