Data Science (Biomarkers) Intern

ProlaioChicago, IL
6h

About The Position

Prolaio believes that continuous learning and collaboration can make a significant difference in how heart care is administered. We are creating smarter ways to address heart disease and heart risks by integrating a connected platform enabled by smart data science to help patients access the care and attention that will inform better treatments and outcomes. We envision a future where care teams and hospitals can be more effective, the healthcare system can be more efficient, and patients have a better care experience and more fulfilling lives. This is precision cardiology, and we know it’s within reach. As a Data Science Intern on the Biomarkers Team, you will develop and validate advanced machine learning pipelines focused on long term time series analysis. Your primary objective is to leverage Foundational Time Series Models and Transfer Learning - such as adapting models from cardiac arrhythmia ECG detection to extract novel physiological insights from wearable and clinical sensor data. This role is essential for advancing our understanding of patient health by translating raw signal data into validated digital biomarkers.

Requirements

  • Academic Background: Currently enrolled in a Master’s or PhD program in Data Science, Electrical Engineering, Biomedical Engineering, Computer Science, or a related quantitative field.
  • Technical Proficiency: Strong proficiency in Python and deep learning frameworks (e.g., PyTorch or TensorFlow) with a specific focus on time series or signal data.
  • Signal Processing Expertise: Familiarity with digital signal processing (DSP) techniques, such as Fourier transforms, wavelet analysis, and windowing methods for physiological data.
  • Machine Learning Knowledge: Solid understanding of modern ML architectures (CNNs, RNNs, Transformers) and experience with Transfer Learning or fine-tuning large-scale models.
  • Data Handling: Experience working with real-world sensor data, handling irregular sampling rates, and managing large-scale longitudinal datasets.

Responsibilities

  • Model Development & Transfer Learning: Test existing and fine-tune deep learning architectures for time series data, specifically utilizing transfer learning techniques to adapt pre-trained ECG-based cardiac models for new physiological signal tasks.
  • Signal Processing Pipeline: Build and optimize Python-based signal processing workflows to handle noisy, real-world sensor data, including filtering, feature extraction, and artifact removal.
  • Foundational Model Implementation: Research and implement emerging foundational time series models to evaluate their zero-shot or few-shot performance on proprietary longitudinal cardiac datasets.
  • Validation & Benchmarking: Design rigorous validation frameworks comparing model outputs against a clinician-verified "ground truth" to establish metrics like Mean Absolute Error (MAE) and Intraclass Correlation (ICC).
  • Codebase Delivery: Maintain a clean, documented, and reproducible code repository that transforms raw high-frequency signals into structured, analysis-ready biomarker datasets.

Benefits

  • Impactful Work: You will join in the fight against heart failure (HF) and hypertrophic cardiomyopathy (HCM) with the goal of extending and saving the lives of our patients while also being at the forefront of changing the healthcare industry through technology.
  • Innovative Environment: You will be part of an organization doing something that’s never been done before.
  • Professional Growth: You will join a growing team and have a substantial impact on our daily and future operations with the opportunity to continuously learn and grow.
  • Collaborative Team: You will be part of a team of collaborative, curious, and committed individuals focused on the collective good, inclusiveness, scientific excellence, and advancing digital health for cardiology.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service