TY - JOUR
T1 - Machine Learning Approach for Chronic Kidney Disease Risk Prediction Combining Conventional Risk Factors and Novel Metabolic Indices
AU - Jallow, Amadou Wurry
AU - Bah, Adama N.S.
AU - Bah, Karamo
AU - Hsu, Chien Yeh
AU - Chu, Kuo Chung
N1 - Publisher Copyright:
© 2022 by the authors.
PY - 2022/12
Y1 - 2022/12
N2 - Patients at risk of chronic kidney disease (CKD) must be identified early and precisely in order to prevent complications, save lives, and limit expenditures for patients and health systems. This study aimed to develop a simple, high-precision machine learning model to identify individuals at risk of developing CKD in the near future, using a novel metabolic index with or without creatinine. This retrospective cohort study used data from the MJ medical record database collected between 2001 and 2015 in Taiwan. We used Cox hazard regression to identify potential predictors, including the novel metabolic index, for use as variables in the models. To develop a machine learning-based CKD risk model with fewer variables, we performed several experimental analyses to combine interacting variables into subsets. Those subsets were used to train three models, random forest, logistic regression, and XGBoost, with or without adding creatinine. The study included 12,189 participants, 20% with and 80% without CKD. The most important conventional predictors of CKD are age and gender. The novel metabolic index, TyG-Index, TG/HDL-ratio and VAI, had stronger predictive power than the conventional risk factors. Without including creatinine data, the XGBoost provided the best predictive performance. After adding creatinine, the performance of all the models was excellent, outperforming both conventional indicators and existing clinical algorithms for CKD. Using novel metabolic index in machine learning-based CKD risk prediction can accurately identify individuals at risk of diagnosis with CKD in the next year, with or without including creatinine.
AB - Patients at risk of chronic kidney disease (CKD) must be identified early and precisely in order to prevent complications, save lives, and limit expenditures for patients and health systems. This study aimed to develop a simple, high-precision machine learning model to identify individuals at risk of developing CKD in the near future, using a novel metabolic index with or without creatinine. This retrospective cohort study used data from the MJ medical record database collected between 2001 and 2015 in Taiwan. We used Cox hazard regression to identify potential predictors, including the novel metabolic index, for use as variables in the models. To develop a machine learning-based CKD risk model with fewer variables, we performed several experimental analyses to combine interacting variables into subsets. Those subsets were used to train three models, random forest, logistic regression, and XGBoost, with or without adding creatinine. The study included 12,189 participants, 20% with and 80% without CKD. The most important conventional predictors of CKD are age and gender. The novel metabolic index, TyG-Index, TG/HDL-ratio and VAI, had stronger predictive power than the conventional risk factors. Without including creatinine data, the XGBoost provided the best predictive performance. After adding creatinine, the performance of all the models was excellent, outperforming both conventional indicators and existing clinical algorithms for CKD. Using novel metabolic index in machine learning-based CKD risk prediction can accurately identify individuals at risk of diagnosis with CKD in the next year, with or without including creatinine.
KW - chronic kidney disease (CKD)
KW - machine learning
KW - novel metabolic indices
KW - risk factors
UR - http://www.scopus.com/inward/record.url?scp=85143723226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85143723226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/record.url?scp=85143723226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85143723226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/record.url?scp=85143723226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85143723226&partnerID=8YFLogxK
U2 - 10.3390/app122312001
DO - 10.3390/app122312001
M3 - Article
AN - SCOPUS:85143723226
SN - 2076-3417
VL - 12
JO - Applied Sciences (Switzerland)
JF - Applied Sciences (Switzerland)
IS - 23
M1 - 12001
ER -