top of page

Causal functional effects

NSF_illustration_minimal.png

Smart wearable technology has vast potential to transform healthcare, yet their social and economic value remains largely untapped due to the inability of conventional analysis methods to handle high temporal complexity induced by their functional nature. The fidelity of data
varies across individuals due to differences in hardware, charging practices, and usage variability. By combining the strength of functional data analysis (FDA) with modern machine learning techniques—termed functional data learning (FDL)—the analysis and interpretation of high-dimensional wearable data are enhanced, providing deeper insights into health trends.


Current work:
Kernel-based estimators for functional causal effects, Journal of Machine Learning Research (under review)

Digital health monitoring 

Passive_monitoring_2020.png

Not all walking is the same. Changes in the way we walk, talk or move, can be indicative of our health or disease progression. Traditional medical examination provides only a snippet in ones health. Wearable devices can be used to measure how our walking, voice or quality of life changes continiously and this can empower medical experts with a newfound understanding of a range of conditions such as Parkinson's disease. However, deriving robust and meaningful digital biomarkers for Parkinson's is difficult open problem.

Published work:

  • Quantifying arm swing in Parkinson's disease: a method accounting for arm activities during free-living gait, Journal of Neuroengineering and Rehabilitation

  • Passive Monitoring of Parkinson Tremor in Daily Life: A Prototypical Network Approach, Sensors

  • Automatic quality control and enhancement for voice-based remote Parkinson’s disease detection, Speech Communication

  • Probabilistic modelling of gait for robust passive monitoring in daily life, IEEE Journal of Biomedical and Health Informatics

  • Real Life Gait Performance as Biomarker of Motor fluctuations, Journal of medical Internet research

  • Probabilistic modelling for unsupervised analysis of human behaviour in smart cities, Sensors

  • Quality control of voice recordings in remote Parkinson’s disease monitoring using the infinite hidden Markov model, ICASSP

  • Automated quality control for sensor-based symptom measurement performed outside the lab, Sensors​

In peer review:

  • A generalizable and open-source algorithm for real-life monitoring of tremor in Parkinson’s disease, npj Parkinson's disease

  • Leveraging Wearable Sensors for Real-World Monitoring and Medication Response Evaluation of Parkinson’s tremor, npj Parkinson's disease

  • Heart rate monitoring using wrist photoplethysmography in Parkinson’s disease:
    feasibility and relation with autonomic dysfunction, npj Parkinson's disease

  • Few-shot time series segmentation using prototype-defined infinite hidden Markov models 

Media coverage on Parkinson's monitoring:

​

What to do when K-means fails? 

K_MEANS_2.png

Despite the plethora of more flexible approaches to clustering, simple techniques such as variants of K-means remain  widely used due to their scalability and interpretable behaviour. Exploring links betweek K-means and graphical models, we have developed flexible alternatives of K-means which address many of its problems related to: (1) setting the number of clusters; (2) initialization (3)  cluster sphericity and density; (4) scalability. We also develop hierarchical extensions related to clustering across different contexts. 

Published work:

  • What to Do When K-Means Clustering Fails: A Simple yet Principled Alternative Algorithm, PLoS One

  • Simple approximate MAP inference for Dirichlet processes mixtures, Electronic Journal of Statistics

  • A deterministic inference framework for discrete nonparametric latent variable models: learning complex probabilistic models with simple algorithms, PhD Thesis​

Finding shared factors and subspaces in high dimensional data

aPPCA_illustration.png

Doing inference in high-dimensional spaces is one of the core challenges in statistical machine learning. A workhorse in addressing this challenge are linear dimensionality techniques such as principal component analysis and factor analysis. Here we look at principled piecewise linear methods to dimensionality reduction which achieve both sparser decomposition and identify clustering structure in the original space.

​

Adaptive probabilistic principal component analysis, BNP@Neurips

Adaptive Latent Feature Sharing for Piecewise Linear Dimensionality Reduction, Journal of Machine Learning Research

Can you monitoring the occupancy of a room using a single motion sensor? 

occupant_estimation.png

Occupancy estimation systems are becoming ubiquitous part of smart buildings. However, majority of them rely on privacy invading camera sensors or multiple motion sensors placed at all exits. We show that approximate occupancy estimation can be easily achieved using a novel affordable setup with a single motion sensor.

 

Published work:

Single-cell RNA sequence clustering 

fgene-10-01253-g003.jpg

In this study, we compare a large set of popular single-cell clustering algorithms and pre-processing strategies. The study demonstrates that the clustering performance of single-cell data often depends on just as much on appropriate preprocessing, as it does on the specific clustering algorithm. We also develop a novel Bayesian nonparametric method for single-cell clustering with the ensMAPDP package release coming soon.

  • ​Benchmark and parameter sensitivity analysis of scRNAseq clustering methods, Frontiers in Genetics​​

© 2025 by Yordan P. Raykov

    bottom of page