TY - JOUR
T1 - Artificial intelligence⇓based prediction of lung cancer risk using nonimaging electronic medical records
T2 - Deep learning approach
AU - Yeh, Marvin Chia Han
AU - Wang, Yu Hsiang
AU - Yang, Hsuan Chia
AU - Bai, Kuan Jen
AU - Wang, Hsiao Han
AU - Li, Yu Chuan Jack
N1 - Funding Information:
This research was funded in part by Ministry of Education (MOE) grants MOE 109-6604-001-400 and DP2-110-21121-01-A-01.
Publisher Copyright:
©Hsuan-Chia Yang, Yu-Hsiang Wang, Hsuan-Chia Yang, Kuan-Jen Bai, Hsiao-Han Wang, Yu-Chuan (Jack) Li.
PY - 2021/8
Y1 - 2021/8
N2 - Background: Artificial intelligence approaches can integrate complex features and can be used to predict a patient’s risk of developing lung cancer, thereby decreasing the need for unnecessary and expensive diagnostic interventions. Objective: The aim of this study was to use electronic medical records to prescreen patients who are at risk of developing lung cancer. Methods: We randomly selected 2 million participants from the Taiwan National Health Insurance Research Database who received care between 1999 and 2013. We built a predictive lung cancer screening model with neural networks that were trained and validated using pre-2012 data, and we tested the model prospectively on post-2012 data. An age- and gender-matched subgroup that was 10 times larger than the original lung cancer group was used to assess the predictive power of the electronic medical record. Discrimination (area under the receiver operating characteristic curve [AUC]) and calibration analyses were performed. Results: The analysis included 11,617 patients with lung cancer and 1,423,154 control patients. The model achieved AUCs of 0.90 for the overall population and 0.87 in patients ≥55 years of age. The AUC in the matched subgroup was 0.82. The positive predictive value was highest (14.3%) among people aged ≥55 years with a pre-existing history of lung disease. Conclusions: Our model achieved excellent performance in predicting lung cancer within 1 year and has potential to be deployed for digital patient screening. Convolution neural networks facilitate the effective use of EMRs to identify individuals at high risk for developing lung cancer.
AB - Background: Artificial intelligence approaches can integrate complex features and can be used to predict a patient’s risk of developing lung cancer, thereby decreasing the need for unnecessary and expensive diagnostic interventions. Objective: The aim of this study was to use electronic medical records to prescreen patients who are at risk of developing lung cancer. Methods: We randomly selected 2 million participants from the Taiwan National Health Insurance Research Database who received care between 1999 and 2013. We built a predictive lung cancer screening model with neural networks that were trained and validated using pre-2012 data, and we tested the model prospectively on post-2012 data. An age- and gender-matched subgroup that was 10 times larger than the original lung cancer group was used to assess the predictive power of the electronic medical record. Discrimination (area under the receiver operating characteristic curve [AUC]) and calibration analyses were performed. Results: The analysis included 11,617 patients with lung cancer and 1,423,154 control patients. The model achieved AUCs of 0.90 for the overall population and 0.87 in patients ≥55 years of age. The AUC in the matched subgroup was 0.82. The positive predictive value was highest (14.3%) among people aged ≥55 years with a pre-existing history of lung disease. Conclusions: Our model achieved excellent performance in predicting lung cancer within 1 year and has potential to be deployed for digital patient screening. Convolution neural networks facilitate the effective use of EMRs to identify individuals at high risk for developing lung cancer.
KW - Artificial intelligence
KW - Electronic medical record
KW - Lung cancer screening
UR - http://www.scopus.com/inward/record.url?scp=85112109663&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112109663&partnerID=8YFLogxK
U2 - 10.2196/26256
DO - 10.2196/26256
M3 - Review article
C2 - 34342588
AN - SCOPUS:85112109663
SN - 1439-4456
VL - 23
JO - Journal of Medical Internet Research
JF - Journal of Medical Internet Research
IS - 8
M1 - e26256
ER -