Staff Data Scientist

IdexxWestbrook, ID
$155,000 - $165,000

About The Position

PRIMARY DUTIES AND RESPONSIBILITIES: Research, design, implement and validate cutting-edge algorithms using new statistical or other mathematical methodologies to analyze diverse sources of data to achieve targeted outcomes. Develop, enhance and maintain advanced analytic models that generate new business insights or deliver predictive services to applications/teams within IDEXX. Perform predictive modeling using machine learning techniques including aspects of data sourcing, data cleaning, data transformation, feature extraction/generation, model tuning, model selection, model evaluation and production deployment. Collaborate with product management and engineering departments to understand company needs and devise possible solutions . Work with cross-functional team members to identify and prioritize actionable, high-impact data insights across a variety of core business areas. Keep up to date with latest trends in machine learning, artificial intelligence, big data mining and solutions. Acts as internal consultant, advocate, mentor and change agent. Collaborate and communicate with software developers in the translation of research model to production code. Translate and effectively communicate complex concepts to a broad audience. Collaborate and communicate with data integration teams when data quality issues are discovered. Communicate results and ideas to key decision makers. Maintains appropriate knowledge of clinical systems used across veterinary healthcare industry. EDUCATION: Master's Degree or equivalent combination of education and experience . PhD preferred. REQUIRED SKILLS AND ABILITIES: Proven experience with data formats, structures and common methods in data transformation (CSV, XLS, JSON, Avro, Parquet as well as in relational and NoSQL databases such as MySQL, Oracle and HBase); data sources (cloud systems, AWS, MySQL, Oracle); SQL and data manipulation; big data tools (Spark, Hadoop). Experience using core data science tools such as R, SAS, Python, Matlab , Java; boutique machine learning tools (TensorFlow, Theano, H2O Machine Learning). Strong statistical foundation with broad knowledge of deterministic and probabilistic statistical methods. Excellent pattern recognition and predictive modeling skills. Thorough data exploration in advance of predictive modeling. Technical knowledge of distributed computing platforms and common data process flows. Classification – decision trees, logistic regression, random forest, SVM, neural networks Regression – linear, nonlinear, boosted decision trees Clustering – K-means, hierarchical, mixture modeling, anomaly detection Time series – ARIMA Dimensionality reduction – PCA, SVD Experience in health care domain preferred Natural curiosity to research and identify possible quantitative solutions to common business problems. A strong business orientation, able to select the appropriate complex quantitative methodologies in response to specific business goals. Strong communication skills, both verbal and written, including ability to translate technical subject matter to non-technical audiences (both as a speaker and listener). Fluency in the English language. PHYSICAL DEMANDS: Extensive sitting, phone and computer use. Some travel may be . WORK ENVIRONMENT: General office environment. Normal office noise level, with occasional moderate noise. LEVELING GUIDE: Technically Experienced Designs and builds new data models utilizing off-the-shelf machine learning algorithms and tools. Works efficiently and independently along the whole data science pipeline – acquisition, exploration, cleaning, modeling and evaluation. Experience with various data structures and common methods in data transformation. Excellent pattern recognition and predictive modeling skills. Ability to create new tools and packages for data science use in statistical/data science programming environments such as R, SAS, Python/Pandas. Ability to create new complex SQL data queries Focused Experience with doing data sourcing, manipulation, analysis on Hadoop and/or Spark platform. Skilled Core Competencies (building on Data Scientist) Think Big – demonstrates strategic mindset and cultivates innovation. Ensures Impact – talented at planning and driving results. Works Across Organization – Skilled capabilities in building informal and formal networks, with strong persuasion skills. Brings People with you – Capable of developing talent to meet both their career and organizational goals.

Requirements

  • Proven experience with data formats, structures and common methods in data transformation (CSV, XLS, JSON, Avro, Parquet as well as in relational and NoSQL databases such as MySQL, Oracle and HBase); data sources (cloud systems, AWS, MySQL, Oracle); SQL and data manipulation; big data tools (Spark, Hadoop).
  • Experience using core data science tools such as R, SAS, Python, Matlab , Java; boutique machine learning tools (TensorFlow, Theano, H2O Machine Learning).
  • Strong statistical foundation with broad knowledge of deterministic and probabilistic statistical methods.
  • Excellent pattern recognition and predictive modeling skills.
  • Thorough data exploration in advance of predictive modeling.
  • Technical knowledge of distributed computing platforms and common data process flows.
  • Classification – decision trees, logistic regression, random forest, SVM, neural networks Regression – linear, nonlinear, boosted decision trees Clustering – K-means, hierarchical, mixture modeling, anomaly detection Time series – ARIMA Dimensionality reduction – PCA, SVD
  • Natural curiosity to research and identify possible quantitative solutions to common business problems.
  • A strong business orientation, able to select the appropriate complex quantitative methodologies in response to specific business goals.
  • Strong communication skills, both verbal and written, including ability to translate technical subject matter to non-technical audiences (both as a speaker and listener).
  • Fluency in the English language.
  • Extensive sitting, phone and computer use.

Nice To Haves

  • PhD preferred.
  • Experience in health care domain preferred

Responsibilities

  • Research, design, implement and validate cutting-edge algorithms using new statistical or other mathematical methodologies to analyze diverse sources of data to achieve targeted outcomes.
  • Develop, enhance and maintain advanced analytic models that generate new business insights or deliver predictive services to applications/teams within IDEXX.
  • Perform predictive modeling using machine learning techniques including aspects of data sourcing, data cleaning, data transformation, feature extraction/generation, model tuning, model selection, model evaluation and production deployment.
  • Collaborate with product management and engineering departments to understand company needs and devise possible solutions .
  • Work with cross-functional team members to identify and prioritize actionable, high-impact data insights across a variety of core business areas.
  • Keep up to date with latest trends in machine learning, artificial intelligence, big data mining and solutions.
  • Acts as internal consultant, advocate, mentor and change agent.
  • Collaborate and communicate with software developers in the translation of research model to production code.
  • Translate and effectively communicate complex concepts to a broad audience.
  • Collaborate and communicate with data integration teams when data quality issues are discovered.
  • Communicate results and ideas to key decision makers.
  • Maintains appropriate knowledge of clinical systems used across veterinary healthcare industry.

Benefits

  • Additional benefits include pet insurance, mental health resources, volunteer paid days off, employee stock program, foundation donation matching, and more!
  • 5% matching 401k
  • Health / Dental / Vision Benefits
  • Day-One Opportunity for annual cash bonus
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service