Abstract
In Taiwan, diseases with cardiovascular include heart disease and cerebrovascular disease among the top ten causes of death. With the development of data mining in the medical field, it can be used to establish the risk prediction models of disease as a tool to assist physicians in decision-making in diagnosis. The purpose of this study is taking Taiwanese health examination data as an example, to identify the significant risk factors of cardiovascular disease and to establish the risk assessment model of cardiovascular disease using data mining. Using Chi-square test and Information Gain identify the correlation between various factors and cardiovascular diseases. Using eight algorithms such as decision tree, random forest, XGBoost, neural network, logistic regression, support vector machine, K nearest neighbor algorithm and voting algorithm to establish the risk assessment model of cardiovascular disease and using confusion matrix and AUC as model evaluation. Compare model performance with different factor combinations. Experiment result shows that it finds 22 questionnaires and biochemical variable risk factors affecting cardiovascular disease and 10 questionnaire factors. ANN and VOTE with 22 factors of Information Gain (threshold > 0.01) are the best models. Both models have the same accuracy (0.88) and AUC (0.90). The best model of questionnaire variable is ANN with the accuracy rate is 0.89, and the AUC is 0.91.
| Original language | English |
|---|---|
| Title of host publication | Innovative Computing - Proceedings of the 5th International Conference on Innovative Computing, IC 2022 |
| Editors | Yan Pei, Jia-Wei Chang, Jason C. Hung |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 156-166 |
| Number of pages | 11 |
| ISBN (Print) | 9789811941313 |
| DOIs | |
| Publication status | Published - 2022 |
| Externally published | Yes |
| Event | 5th International Conference on Innovative Computing, IC 2022 - Guam, United States Duration: Jan 19 2022 → Jan 21 2022 |
Publication series
| Name | Lecture Notes in Electrical Engineering |
|---|---|
| Volume | 935 LNEE |
| ISSN (Print) | 1876-1100 |
| ISSN (Electronic) | 1876-1119 |
Conference
| Conference | 5th International Conference on Innovative Computing, IC 2022 |
|---|---|
| Country/Territory | United States |
| City | Guam |
| Period | 1/19/22 → 1/21/22 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Keywords
- Cardiovascular disease
- Data mining
- Feature selection
- Risk assessment model
ASJC Scopus subject areas
- Industrial and Manufacturing Engineering
Fingerprint
Dive into the research topics of 'Using AI Algorithm to Establish the CVD Risk Assessment Model'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS