TY - JOUR
T1 - Screening the risk of obstructive sleep apnea by utilizing supervised learning techniques based on anthropometric features and snoring events
AU - Tsai, Cheng-Yu
AU - Liu, Wen-Te
AU - Hsu, Wen-Hua
AU - Majumdar, Arnab
AU - Stettler, Marc
AU - Lee, Kang-Yun
AU - Cheng, Wun-Hao
AU - Wu, Dean
AU - Lee, Hsin-Chien
AU - Kuan, Yi-Chun
AU - Wu, Cheng-Jung
AU - Lin, Yi-Chih
AU - Ho, Shu-Chuan
N1 - Publisher Copyright:
© The Author(s) 2023.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - Objectives: Obstructive sleep apnea (OSA) is typically diagnosed by polysomnography (PSG). However, PSG is time-consuming and has some clinical limitations. This study thus aimed to establish machine learning models to screen for the risk of having moderate-to-severe and severe OSA based on easily acquired features. Methods: We collected PSG data on 3529 patients from Taiwan and further derived the number of snoring events. Their baseline characteristics and anthropometric measures were obtained, and correlations among the collected variables were investigated. Next, six common supervised machine learning techniques were utilized, including random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbor (kNN), support vector machine (SVM), logistic regression (LR), and naïve Bayes (NB). First, data were independently separated into a training and validation dataset (80%) and a test dataset (20%). The approach with the highest accuracy in the training and validation phase was employed to classify the test dataset. Next, feature importance was investigated by calculating the Shapley value of every factor, which represented the impact on OSA risk screening. Results: The RF produced the highest accuracy (of >70%) in the training and validation phase in screening for both OSA severities. Hence, we employed the RF to classify the test dataset, and results showed a 79.32% accuracy for moderate-to-severe OSA and 74.37% accuracy for severe OSA. Snoring events and the visceral fat level were the most and second most essential features of screening for OSA risk. Conclusions: The established model can be considered for screening for the risk of having moderate-to-severe or severe OSA.
AB - Objectives: Obstructive sleep apnea (OSA) is typically diagnosed by polysomnography (PSG). However, PSG is time-consuming and has some clinical limitations. This study thus aimed to establish machine learning models to screen for the risk of having moderate-to-severe and severe OSA based on easily acquired features. Methods: We collected PSG data on 3529 patients from Taiwan and further derived the number of snoring events. Their baseline characteristics and anthropometric measures were obtained, and correlations among the collected variables were investigated. Next, six common supervised machine learning techniques were utilized, including random forest (RF), extreme gradient boosting (XGBoost), k-nearest neighbor (kNN), support vector machine (SVM), logistic regression (LR), and naïve Bayes (NB). First, data were independently separated into a training and validation dataset (80%) and a test dataset (20%). The approach with the highest accuracy in the training and validation phase was employed to classify the test dataset. Next, feature importance was investigated by calculating the Shapley value of every factor, which represented the impact on OSA risk screening. Results: The RF produced the highest accuracy (of >70%) in the training and validation phase in screening for both OSA severities. Hence, we employed the RF to classify the test dataset, and results showed a 79.32% accuracy for moderate-to-severe OSA and 74.37% accuracy for severe OSA. Snoring events and the visceral fat level were the most and second most essential features of screening for OSA risk. Conclusions: The established model can be considered for screening for the risk of having moderate-to-severe or severe OSA.
KW - Obstructive sleep apnea
KW - Shapley value
KW - anthropometric measure
KW - machine learning
KW - snoring event
UR - http://www.scopus.com/inward/record.url?scp=85150493539&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85150493539&partnerID=8YFLogxK
U2 - 10.1177/20552076231152751
DO - 10.1177/20552076231152751
M3 - Article
C2 - 36896329
SN - 2055-2076
VL - 9
SP - 20552076231152751
JO - Digital Health
JF - Digital Health
ER -