Senior Data Scientist

CertiKNew York, NY

About The Position

The primary responsibility of this role is to build/maintain ETL pipelines & process large datasets from APIs/databases/third-party platforms to enable real-time team analytics and automate data preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting. Analyze large-scale blockchain/transactional/social-media datasets to identify patterns/trends/anomalies/risk indicators. Develop/apply machine learning models (graph-based algorithms & NLP techniques) for threat detection/behavioral analysis/monitoring. Perform feature engineering/model training/testing/validation to ensure accuracy/robustness/interpretability. Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data. Create dashboards/visualizations to deliver actionable insights & provide data-driven guidance for strategic planning. Collaborate with engineering/product/business teams to translate analytical requirements into scalable data-science solutions.

Requirements

  • Master’s degree in Data Science, Statistics, or a related field
  • Sound knowledge of feature engineering/model evaluation/validation & on-chain patterns/risk-analysis/threat-detection methodologies
  • In-depth understanding of blockchain/distributed ledger data structures & analytics
  • Strong ability to apply machine-learning & statistical modeling techniques to large-scale datasets
  • Expertise in analyzing graph/text-based or transactional data
  • Familiar with cloud platforms (AWS/Azure/GCP) & Spark-based distributed-computing systems (e.g., Databricks)
  • Proficient in Python, SQL (PostgreSQL/MySQL/NoSQL) & ETL tools (Apache Airflow)

Responsibilities

  • Build/maintain ETL pipelines & process large datasets from APIs/databases/third-party platforms to enable real-time team analytics
  • Automate data preprocessing (cleaning/normalization/validation) for client accounts using rule-based logic/statistical checks to ensure data quality & prepare analysis-ready datasets for modeling/reporting
  • Analyze large-scale blockchain/transactional/social-media datasets to identify patterns/trends/anomalies/risk indicators
  • Develop/apply machine learning models (graph-based algorithms & NLP techniques) for threat detection/behavioral analysis/monitoring
  • Perform feature engineering/model training/testing/validation to ensure accuracy/robustness/interpretability
  • Design/implement scalable data pipelines/ETL processes & CI/CD workflows for ingestion/preprocessing/aggregating blockchain & social media data
  • Create dashboards/visualizations to deliver actionable insights & provide data-driven guidance for strategic planning
  • Collaborate with engineering/product/business teams to translate analytical requirements into scalable data-science solutions

Benefits

  • medical insurance
  • vision insurance
  • dental insurance
  • 401(k) plan with company matching
  • life and accidental death and dismemberment insurance
  • HSA (with high deductible plan)
  • FSA
  • flexible paid time off
  • holidays
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service