TY - JOUR
T1 - Incorporating deep learning and multi-omics autoencoding for analysis of lung adenocarcinoma prognostication
AU - Lee, Tzong Yi
AU - Huang, Kai Yao
AU - Chuang, Cheng Hsiang
AU - Lee, Cheng Yang
AU - Chang, Tzu Hao
N1 - Publisher Copyright:
© 2020 Elsevier Ltd
PY - 2020/8
Y1 - 2020/8
N2 - Lung cancer is the most occurring cancer type, and its mortality rate is also the highest, among them lung adenocarcinoma (LUAD) accounts for about 40 % of lung cancer. There is an urgent need to develop a prognosis prediction model for lung adenocarcinoma. Previous LUAD prognosis studies only took single-omics data, such as mRNA or miRNA, into consideration. To this end, we proposed a deep learning-based autoencoding approach for combination of four-omics data, mRNA, miRNA, DNA methylation and copy number variations, to construct an autoencoder model, which learned representative features to differentiate the two optimal patient subgroups with a significant difference in survival (P = 4.08e-09) and good consistency index (C-index = 0.65). The multi-omics model was validated though four independent datasets, i.e. GSE81089 for mRNA (n = 198, P = 0.0083), GSE63805 for miRNA (n = 32, P = 0.018), GSE63384 for DNA methylation (n = 35, P = 0.009), and TCGA independent samples for copy number variations (n = 94, P = 0.0052). Finally, a functional analysis was performed on two survival subgroups to discover genes involved in biological processes and pathways. This is the first study incorporating deep autoencoding and four-omics data to construct a robust survival prediction model, and results show the approach is useful at predicting LUAD prognostication.
AB - Lung cancer is the most occurring cancer type, and its mortality rate is also the highest, among them lung adenocarcinoma (LUAD) accounts for about 40 % of lung cancer. There is an urgent need to develop a prognosis prediction model for lung adenocarcinoma. Previous LUAD prognosis studies only took single-omics data, such as mRNA or miRNA, into consideration. To this end, we proposed a deep learning-based autoencoding approach for combination of four-omics data, mRNA, miRNA, DNA methylation and copy number variations, to construct an autoencoder model, which learned representative features to differentiate the two optimal patient subgroups with a significant difference in survival (P = 4.08e-09) and good consistency index (C-index = 0.65). The multi-omics model was validated though four independent datasets, i.e. GSE81089 for mRNA (n = 198, P = 0.0083), GSE63805 for miRNA (n = 32, P = 0.018), GSE63384 for DNA methylation (n = 35, P = 0.009), and TCGA independent samples for copy number variations (n = 94, P = 0.0052). Finally, a functional analysis was performed on two survival subgroups to discover genes involved in biological processes and pathways. This is the first study incorporating deep autoencoding and four-omics data to construct a robust survival prediction model, and results show the approach is useful at predicting LUAD prognostication.
KW - Autoencoder
KW - Deep learning
KW - Lung adenocarcinoma
KW - Machine learning
KW - Multi-omics
KW - Prognosis prediction
KW - Survival analysis
UR - http://www.scopus.com/inward/record.url?scp=85085737506&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85085737506&partnerID=8YFLogxK
U2 - 10.1016/j.compbiolchem.2020.107277
DO - 10.1016/j.compbiolchem.2020.107277
M3 - Article
AN - SCOPUS:85085737506
SN - 1476-9271
VL - 87
JO - Computational Biology and Chemistry
JF - Computational Biology and Chemistry
M1 - 107277
ER -