TY - JOUR
T1 - MCNN-ETC
T2 - Identifying electron transporters and their functional families by using multiple windows scanning techniques in convolutional neural networks with evolutionary information of protein sequences
AU - Ho, Quang Thai
AU - Le, Nguyen Quoc Khanh
AU - Ou, Yu Yen
N1 - Funding Information:
This work was partially supported by the Ministry of Science and Technology, Taiwan, R.O.C. under Grant no. MOST 109-2221-E-155-045 and MOST 110-2221-E-155-038-MY2.
Publisher Copyright:
© 2021 The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: [email protected].
PY - 2022/1/1
Y1 - 2022/1/1
N2 - In the past decade, convolutional neural networks (CNNs) have been used as powerful tools by scientists to solve visual data tasks. However, many efforts of convolutional neural networks in solving protein function prediction and extracting useful information from protein sequences have certain limitations. In this research, we propose a new method to improve the weaknesses of the previous method. mCNN-ETC is a deep learning model which can transform the protein evolutionary information into image-like data composed of 20 channels, which correspond to the 20 amino acids in the protein sequence. We constructed CNN layers with different scanning windows in parallel to enhance the useful pattern detection ability of the proposed model. Then we filtered specific patterns through the 1-max pooling layer before inputting them into the prediction layer. This research attempts to solve a basic problem in biology in terms of application: predicting electron transporters and classifying their corresponding complexes. The performance result reached an accuracy of 97.41%, which was nearly 6% higher than its predecessor. We have also published a web server on http://bio219.bioinfo.yzu.edu.tw, which can be used for research purposes free of charge.
AB - In the past decade, convolutional neural networks (CNNs) have been used as powerful tools by scientists to solve visual data tasks. However, many efforts of convolutional neural networks in solving protein function prediction and extracting useful information from protein sequences have certain limitations. In this research, we propose a new method to improve the weaknesses of the previous method. mCNN-ETC is a deep learning model which can transform the protein evolutionary information into image-like data composed of 20 channels, which correspond to the 20 amino acids in the protein sequence. We constructed CNN layers with different scanning windows in parallel to enhance the useful pattern detection ability of the proposed model. Then we filtered specific patterns through the 1-max pooling layer before inputting them into the prediction layer. This research attempts to solve a basic problem in biology in terms of application: predicting electron transporters and classifying their corresponding complexes. The performance result reached an accuracy of 97.41%, which was nearly 6% higher than its predecessor. We have also published a web server on http://bio219.bioinfo.yzu.edu.tw, which can be used for research purposes free of charge.
KW - convolutional neural network
KW - deep learning
KW - electron transport chain
KW - five complexes
KW - motif scanning
KW - position-specific scoring matrix
UR - http://www.scopus.com/inward/record.url?scp=85123814326&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123814326&partnerID=8YFLogxK
U2 - 10.1093/bib/bbab352
DO - 10.1093/bib/bbab352
M3 - Article
C2 - 34472594
AN - SCOPUS:85123814326
SN - 1467-5463
VL - 23
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 1
M1 - bbab352
ER -