TY - JOUR
T1 - Radiomics-based machine learning model for efficiently classifying transcriptome subtypes in glioblastoma patients from MRI
AU - Le, Nguyen Quoc Khanh
AU - Hung, Truong Nguyen Khanh
AU - Do, Duyen Thi
AU - Lam, Luu Ho Thanh
AU - Dang, Luong Huu
AU - Huynh, Tuan Tu
N1 - Funding Information:
This work was supported by the Research Grant for Newly Hired Faculty, Taipei Medical University [grant number: TMU108-AE1-B26 ] and Higher Education Sprout Project, Ministry of Education, Taiwan [grant number: DP2-109-21121-01-A-06 ].
Publisher Copyright:
© 2021 Elsevier Ltd
PY - 2021/5
Y1 - 2021/5
N2 - Background: In the field of glioma, transcriptome subtypes have been considered as an important diagnostic and prognostic biomarker that may help improve the treatment efficacy. However, existing identification methods of transcriptome subtypes are limited due to the relatively long detection period, the unattainability of tumor specimens via biopsy or surgery, and the fleeting nature of intralesional heterogeneity. In search of a superior model over previous ones, this study evaluated the efficiency of eXtreme Gradient Boosting (XGBoost)-based radiomics model to classify transcriptome subtypes in glioblastoma patients. Methods: This retrospective study retrieved patients from TCGA-GBM and IvyGAP cohorts with pathologically diagnosed glioblastoma, and separated them into different transcriptome subtypes groups. GBM patients were then segmented into three different regions of MRI: enhancement of the tumor core (ET), non-enhancing portion of the tumor core (NET), and peritumoral edema (ED). We subsequently used handcrafted radiomics features (n = 704) from multimodality MRI and two-level feature selection techniques (Spearman correlation and F-score tests) in order to find the features that could be relevant. Results: After the feature selection approach, we identified 13 radiomics features that were the most meaningful ones that can be used to reach the optimal results. With these features, our XGBoost model reached the predictive accuracies of 70.9%, 73.3%, 88.4%, and 88.4% for classical, mesenchymal, neural, and proneural subtypes, respectively. Our model performance has been improved in comparison with the other models as well as previous works on the same dataset. Conclusion: The use of XGBoost and two-level feature selection analysis (Spearman correlation and F-score) could be expected as a potential combination for classifying transcriptome subtypes with high performance and might raise public attention for further research on radiomics-based GBM models.
AB - Background: In the field of glioma, transcriptome subtypes have been considered as an important diagnostic and prognostic biomarker that may help improve the treatment efficacy. However, existing identification methods of transcriptome subtypes are limited due to the relatively long detection period, the unattainability of tumor specimens via biopsy or surgery, and the fleeting nature of intralesional heterogeneity. In search of a superior model over previous ones, this study evaluated the efficiency of eXtreme Gradient Boosting (XGBoost)-based radiomics model to classify transcriptome subtypes in glioblastoma patients. Methods: This retrospective study retrieved patients from TCGA-GBM and IvyGAP cohorts with pathologically diagnosed glioblastoma, and separated them into different transcriptome subtypes groups. GBM patients were then segmented into three different regions of MRI: enhancement of the tumor core (ET), non-enhancing portion of the tumor core (NET), and peritumoral edema (ED). We subsequently used handcrafted radiomics features (n = 704) from multimodality MRI and two-level feature selection techniques (Spearman correlation and F-score tests) in order to find the features that could be relevant. Results: After the feature selection approach, we identified 13 radiomics features that were the most meaningful ones that can be used to reach the optimal results. With these features, our XGBoost model reached the predictive accuracies of 70.9%, 73.3%, 88.4%, and 88.4% for classical, mesenchymal, neural, and proneural subtypes, respectively. Our model performance has been improved in comparison with the other models as well as previous works on the same dataset. Conclusion: The use of XGBoost and two-level feature selection analysis (Spearman correlation and F-score) could be expected as a potential combination for classifying transcriptome subtypes with high performance and might raise public attention for further research on radiomics-based GBM models.
KW - Artificial intelligence
KW - Glioblastoma
KW - Magnetic resonance imaging
KW - Neuroimaging
KW - Radiogenomics
KW - Radiomics biomarker
KW - Transcriptome subtypes
KW - XGBoost
UR - http://www.scopus.com/inward/record.url?scp=85102578982&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85102578982&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2021.104320
DO - 10.1016/j.compbiomed.2021.104320
M3 - Article
C2 - 33735760
AN - SCOPUS:85102578982
SN - 0010-4825
VL - 132
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
M1 - 104320
ER -