Using AI Algorithm to Establish the CVD Risk Assessment Model

Yin Chen Chen, Hsiu An Lee, Chien Yeh Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In Taiwan, diseases with cardiovascular include heart disease and cerebrovascular disease among the top ten causes of death. With the development of data mining in the medical field, it can be used to establish the risk prediction models of disease as a tool to assist physicians in decision-making in diagnosis. The purpose of this study is taking Taiwanese health examination data as an example, to identify the significant risk factors of cardiovascular disease and to establish the risk assessment model of cardiovascular disease using data mining. Using Chi-square test and Information Gain identify the correlation between various factors and cardiovascular diseases. Using eight algorithms such as decision tree, random forest, XGBoost, neural network, logistic regression, support vector machine, K nearest neighbor algorithm and voting algorithm to establish the risk assessment model of cardiovascular disease and using confusion matrix and AUC as model evaluation. Compare model performance with different factor combinations. Experiment result shows that it finds 22 questionnaires and biochemical variable risk factors affecting cardiovascular disease and 10 questionnaire factors. ANN and VOTE with 22 factors of Information Gain (threshold > 0.01) are the best models. Both models have the same accuracy (0.88) and AUC (0.90). The best model of questionnaire variable is ANN with the accuracy rate is 0.89, and the AUC is 0.91.

Original languageEnglish
Title of host publicationInnovative Computing - Proceedings of the 5th International Conference on Innovative Computing, IC 2022
EditorsYan Pei, Jia-Wei Chang, Jason C. Hung
PublisherSpringer Science and Business Media Deutschland GmbH
Pages156-166
Number of pages11
ISBN (Print)9789811941313
DOIs
Publication statusPublished - 2022
Event5th International Conference on Innovative Computing, IC 2022 - Guam, United States
Duration: Jan 19 2022Jan 21 2022

Publication series

NameLecture Notes in Electrical Engineering
Volume935 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference5th International Conference on Innovative Computing, IC 2022
Country/TerritoryUnited States
CityGuam
Period1/19/221/21/22

Keywords

  • Cardiovascular disease
  • Data mining
  • Feature selection
  • Risk assessment model

ASJC Scopus subject areas

  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Using AI Algorithm to Establish the CVD Risk Assessment Model'. Together they form a unique fingerprint.

Cite this