Multi-Modal Sleep Stage Classification With Two-Stream Encoder-Decoder

Zhao Zhang, Bor Shyh Lin, Chih Wei Peng, Bor Shing Lin

研究成果: 雜誌貢獻文章同行評審

1 引文 斯高帕斯(Scopus)

摘要

Sleep staging serves as a fundamental assessment for sleep quality measurement and sleep disorder diagnosis. Although current deep learning approaches have successfully integrated multimodal sleep signals, enhancing the accuracy of automatic sleep staging, certain challenges remain, as follows: 1) optimizing the utilization of multi-modal information complementarity, 2) effectively extracting both long- and short-range temporal features of sleep information, and 3) addressing the class imbalance problem in sleep data. To address these challenges, this paper proposes a two-stream encode-decoder network, named TSEDSleepNet, which is inspired by the depth sensitive attention and automatic multi-modal fusion (DSA2F) framework. In TSEDSleepNet, a two-stream encoder is used to extract the multiscale features of electrooculogram (EOG) and electroencephalogram (EEG) signals. And a self-attention mechanism is utilized to fuse the multiscale features, generating multi-modal saliency features. Subsequently, the coarser-scale construction module (CSCM) is adopted to extract and construct multi-resolution features from the multiscale features and the salient features. Thereafter, a Transformer module is applied to capture both long- and short-range temporal features from the multi-resolution features. Finally, the long- and short-range temporal features are restored with low-layer details and mapped to the predicted classification results. Additionally, the Lovász loss function is applied to alleviate the class imbalance problem in sleep datasets. Our proposed method was tested on the Sleep-EDF-39 and Sleep-EDF-153 datasets, and it achieved classification accuracies of 88.9% and 85.2% and Macro-F1 scores of 84.8% and 79.7%, respectively, thus outperforming conventional traditional baseline models. These results highlight the efficacy of the proposed method in fusing multi-modal information. This method has potential for application as an adjunct tool for diagnosing sleep disorders.
原文英語
頁(從 - 到)2096-2105
頁數10
期刊IEEE Transactions on Neural Systems and Rehabilitation Engineering
32
DOIs
出版狀態已發佈 - 2024

ASJC Scopus subject areas

  • 內科學
  • 一般神經科學
  • 生物醫學工程
  • 復健

指紋

深入研究「Multi-Modal Sleep Stage Classification With Two-Stream Encoder-Decoder」主題。共同形成了獨特的指紋。

引用此