TY - GEN
T1 - Enhanced Prediction of mRNA Subcellular Localization Using a Novel Ensemble Learning and Hybrid Approach
AU - Nguyen, Thi Tuyen
AU - Nguyen, Van Nui
AU - Tran, Thi Xuan
AU - Le, Nguyen Quoc Khanh
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - Unraveling the subcellular localization of mRNA is an imperative aspect in the realm of biotechnology. This resolution can illuminate the inner workings of genetic regulatory mechanisms, gene expression modalities, and the evolution of cellular physiological and developmental processes. However, the experimental delineation of mRNA subcellular localization imposes significant temporal and resource commitments. Despite the development of multiple algorithms and predictive models for mRNA subcellular localization, their performance indexes have not been markedly high. In this paper, we introduce a novel hybrid approach to categorize mRNA into five distinct subcellular locales, including the cytoplasm, endoplasmic reticulum, extracellular region, mitochondria, and nucleus. Our model exploits the strengths of ensemble learning with a hybrid methodology, incorporating multiple biologically pertinent features extracted from the input sequencing data. Additionally, the model dynamically adjusts the weightages of functions and the minority class, through the modulation of the weight ratio of disparate models during their contribution to the principal model. Overall, our model delivers promising results, with an average accuracy of 0.89 in an independent dataset for the classification of mRNA subcellular localizations into five subclasses. This displays a significant performance elevation in contrast to preceding algorithms, particularly in instances where the classes are adequately sampled.
AB - Unraveling the subcellular localization of mRNA is an imperative aspect in the realm of biotechnology. This resolution can illuminate the inner workings of genetic regulatory mechanisms, gene expression modalities, and the evolution of cellular physiological and developmental processes. However, the experimental delineation of mRNA subcellular localization imposes significant temporal and resource commitments. Despite the development of multiple algorithms and predictive models for mRNA subcellular localization, their performance indexes have not been markedly high. In this paper, we introduce a novel hybrid approach to categorize mRNA into five distinct subcellular locales, including the cytoplasm, endoplasmic reticulum, extracellular region, mitochondria, and nucleus. Our model exploits the strengths of ensemble learning with a hybrid methodology, incorporating multiple biologically pertinent features extracted from the input sequencing data. Additionally, the model dynamically adjusts the weightages of functions and the minority class, through the modulation of the weight ratio of disparate models during their contribution to the principal model. Overall, our model delivers promising results, with an average accuracy of 0.89 in an independent dataset for the classification of mRNA subcellular localizations into five subclasses. This displays a significant performance elevation in contrast to preceding algorithms, particularly in instances where the classes are adequately sampled.
KW - Bioinformatics
KW - Ensemble learning
KW - Hybird
KW - Machine learning
KW - mRNA subcellular localization
KW - Sequence analysis
UR - http://www.scopus.com/inward/record.url?scp=85180157164&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85180157164&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-49529-8_7
DO - 10.1007/978-3-031-49529-8_7
M3 - Conference contribution
AN - SCOPUS:85180157164
SN - 9783031495281
T3 - Lecture Notes in Networks and Systems
SP - 60
EP - 68
BT - Advances in Information and Communication Technology - Proceedings of the 2nd International Conference ICTA 2023
A2 - Nghia, Phung Trung
A2 - Thai, Vu Duc
A2 - Thuy, Nguyen Thanh
A2 - Son, Le Hoang
A2 - Huynh, Van-Nam
PB - Springer Science and Business Media Deutschland GmbH
T2 - 2nd International Conference on Advances in Information and Communication Technology, ICTA 2023
Y2 - 13 December 2023 through 14 December 2023
ER -