Categories
Nevin Manimala Statistics

Boosting Automated Sleep Staging Performance in Big Datasets using Population Sub-grouping

Sleep. 2021 May 26:zsab027. doi: 10.1093/sleep/zsab027. Online ahead of print.

ABSTRACT

Current approaches to automated sleep staging from the electroencephalogram (EEG) rely on constructing a large labeled training and test corpora by aggregating data from different individuals. However, many of the subjects in the training set may exhibit changes in the EEG that are very different from the subjects in the test set. Training an algorithm on such data without accounting for this diversity can cause underperformance. Moreover, test data may have unexpected sensor misplacement or different instrument noise and spectral responses. This work proposes a novel method to learn relevant individuals based on their similarities effectively. The proposed method embeds all training patients into a shared and robust feature space. Individuals that share strong statistical relationships and are similar based on their EEG signals are clustered in this feature space before being passed to a deep learning framework for classification. Using 994 patient EEGs from the 2018 Physionet Challenge (≈ 6,561 hours of recording), we demonstrate that the clustering approach significantly boosts performance compared to state-of-the-art deep learning approaches. The proposed method improves, on average, a precision score from 0.72 to 0.81, a sensitivity score from 0.74 to 0.82, and a Cohen’s Kappa coefficient from 0.64 to 0.75 under 10-fold cross-validation.

PMID:34038560 | DOI:10.1093/sleep/zsab027

By Nevin Manimala

Portfolio Website for Nevin Manimala