Data Analyst

Spectral MD IncDallas, TX

About The Position

The Data Analyst will take ownership of analyzing and improving the data pipeline that supports the model training and implementation to enhance model performance and reliability. In this role, the analyst will work closely with the Data Science team to validate data inputs/outputs and report on data quality, while also serving as a key liaison between the data science and statistics teams. This includes supporting the transition of model and data workflows to a statistics-led framework and helping statisticians understand data processes and workflow requirements. From conducting in-depth analysis and supporting data pipeline improvements to collaborating with both engineering and statistical teams, the Data Analyst will help scale data capabilities across the organization. This role requires strong analytical skills, initiative, and the ability to communicate technical workflows clearly to cross-functional partners.

Requirements

  • Master’s degree in computer science, Engineering, Information Technology, Statistics, or a related field.
  • Experience with relational R, SQL and NoSQL databases.
  • Experience with programming in Python.
  • Excellent verbal and written communication skills.
  • Strong attention to detail and organizational skills
  • Ability to work independently, under supervision, and in a creative environment

Nice To Haves

  • 2+ years of experience as a data analyst, business analyst or in a similar role.

Responsibilities

  • Perform the analysis to the data in the whole pipeline used for our AI model training and implementation and provide the analysis report.
  • Validate the inputs and outputs of each phase in the data pipeline, diagnose and fix data accuracy and data flow issues
  • Assemble, aggregate, clean and transform large, complex & diverse data sets for analysis.
  • Support management to the events or projects which use the data to deliver the results, e.g., our truthing event.
  • Help to improve existing data pipelines using Python, R, SQL / NoSQL, and AWS.
  • Work closely with data scientists to define and map data requirements which can be translated into executable data processing pipelines
  • Facilitate the transition of data workflows and deliverables from the data science team to statisticians for downstream analysis.
  • Collaborate with statisticians to explain data generation, preprocessing, and model workflows, ensuring smooth handoffs and shared understanding.
  • Help build documentation and workflow guides to support cross-functional data use.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service