TY - JOUR
T1 - Predicting Hepatocellular Carcinoma with minimal features from electronic health records
T2 - Development of a deep learning model
AU - Liang, Chia Wei
AU - Yang, Hsuan Chia
AU - Islam, Md Mohaimenul
AU - Nguyen, Phung Anh Alex
AU - Feng, Yi Ting
AU - Hou, Ze Yu
AU - Huang, Chih Wei
AU - Poly, Tahmina Nasrin
AU - Li, Yu Chuan Jack
N1 - Funding Information:
This research is granted in part by the Ministry of Education (grant #109-6604-001-400 and #DP2-110-21121-01-A-0) and the Ministry of Science and Technology (grant # MOST 109-2222-E-038-002-MY2).
Publisher Copyright:
©Chia-Wei Liang, Hsuan-Chia Yang, Md Mohaimenul Islam, Phung Anh Alex Nguyen, Yi-Ting Feng, Ze Yu Hou, Chih-Wei Huang, Tahmina Nasrin Poly, Yu-Chuan Jack Li.
PY - 2021/10
Y1 - 2021/10
N2 - Background: Hepatocellular carcinoma (HCC), usually known as hepatoma, is the third leading cause of cancer mortality globally. Early detection of HCC helps in its treatment and increases survival rates. Objective: The aim of this study is to develop a deep learning model, using the trend and severity of each medical event from the electronic health record to accurately predict the patients who will be diagnosed with HCC in 1 year. Methods: Patients with HCC were screened out from the National Health Insurance Research Database of Taiwan between 1999 and 2013. To be included, the patients with HCC had to register as patients with cancer in the catastrophic illness file and had to be diagnosed as a patient with HCC in an inpatient admission. The control cases (non-HCC patients) were randomly sampled from the same database. We used age, gender, diagnosis code, drug code, and time information as the input variables of a convolution neural network model to predict those patients with HCC. We also inspected the highly weighted variables in the model and compared them to their odds ratio at HCC to understand how the predictive model works Results: We included 47,945 individuals, 9553 of whom were patients with HCC. The area under the receiver operating curve (AUROC) of the model for predicting HCC risk 1 year in advance was 0.94 (95% CI 0.937-0.943), with a sensitivity of 0.869 and a specificity 0.865. The AUROC for predicting HCC patients 7 days, 6 months, 1 year, 2 years, and 3 years early were 0.96, 0.94, 0.94, 0.91, and 0.91, respectively. Conclusions: The findings of this study show that the convolutional neural network model has immense potential to predict the risk of HCC 1 year in advance with minimal features available in the electronic health records.
AB - Background: Hepatocellular carcinoma (HCC), usually known as hepatoma, is the third leading cause of cancer mortality globally. Early detection of HCC helps in its treatment and increases survival rates. Objective: The aim of this study is to develop a deep learning model, using the trend and severity of each medical event from the electronic health record to accurately predict the patients who will be diagnosed with HCC in 1 year. Methods: Patients with HCC were screened out from the National Health Insurance Research Database of Taiwan between 1999 and 2013. To be included, the patients with HCC had to register as patients with cancer in the catastrophic illness file and had to be diagnosed as a patient with HCC in an inpatient admission. The control cases (non-HCC patients) were randomly sampled from the same database. We used age, gender, diagnosis code, drug code, and time information as the input variables of a convolution neural network model to predict those patients with HCC. We also inspected the highly weighted variables in the model and compared them to their odds ratio at HCC to understand how the predictive model works Results: We included 47,945 individuals, 9553 of whom were patients with HCC. The area under the receiver operating curve (AUROC) of the model for predicting HCC risk 1 year in advance was 0.94 (95% CI 0.937-0.943), with a sensitivity of 0.869 and a specificity 0.865. The AUROC for predicting HCC patients 7 days, 6 months, 1 year, 2 years, and 3 years early were 0.96, 0.94, 0.94, 0.91, and 0.91, respectively. Conclusions: The findings of this study show that the convolutional neural network model has immense potential to predict the risk of HCC 1 year in advance with minimal features available in the electronic health records.
KW - Convolution neural network
KW - Deep learning
KW - Deep learning model
KW - Hepatocellular carcinoma
KW - Hepatoma
KW - Risk prediction
UR - http://www.scopus.com/inward/record.url?scp=85118714655&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118714655&partnerID=8YFLogxK
U2 - 10.2196/19812
DO - 10.2196/19812
M3 - Article
C2 - 34709180
AN - SCOPUS:85118714655
SN - 2369-1999
VL - 7
SP - e19812
JO - JMIR Cancer
JF - JMIR Cancer
IS - 4
M1 - e19812
ER -