TY - JOUR
T1 - Prediction of FMN-binding residues with three-dimensional probability distributions of interacting atoms on protein surfaces
AU - Mahalingam, Rajasekaran
AU - Peng, Hung Pin
AU - Yang, An Suei
PY - 2014/2/21
Y1 - 2014/2/21
N2 - Flavin mono-nucleotide (FMN) is a cofactor which is involved in many biological reactions. The insights on protein-FMN interactions aid the protein functional annotation and also facilitate in drug design. In this study, we have established a new method, making use of an encoding scheme of the three-dimensional probability density maps that describe the distributions of 40 non-covalent interacting atom types around protein surfaces, to predict FMN-binding sites on protein surfaces. One machine learning model was trained for each of the 30 protein atom types to predict tentative FMN-binding sites on protein structures. The method's capability was evaluated by five-fold cross-validation on a dataset containing 81 non-redundant FMN-binding protein structures and further tested on independent datasets of 30 and 15 non-redundant protein structures respectively. These predictions achieved an accuracy of 0.94, 0.94 and 0.96 with the Matthews correlation coefficient (MCC) of 0.53, 0.53 and 0.65 respectively for the three protein structure sets. The prediction capability is superior to the existing method. This is the first structure-based approach that does not rely on evolutionary information for predicting FMN-interacting residues. The webserver for the prediction is available at http://ismblab.genomics.sinica.edu.tw/.
AB - Flavin mono-nucleotide (FMN) is a cofactor which is involved in many biological reactions. The insights on protein-FMN interactions aid the protein functional annotation and also facilitate in drug design. In this study, we have established a new method, making use of an encoding scheme of the three-dimensional probability density maps that describe the distributions of 40 non-covalent interacting atom types around protein surfaces, to predict FMN-binding sites on protein surfaces. One machine learning model was trained for each of the 30 protein atom types to predict tentative FMN-binding sites on protein structures. The method's capability was evaluated by five-fold cross-validation on a dataset containing 81 non-redundant FMN-binding protein structures and further tested on independent datasets of 30 and 15 non-redundant protein structures respectively. These predictions achieved an accuracy of 0.94, 0.94 and 0.96 with the Matthews correlation coefficient (MCC) of 0.53, 0.53 and 0.65 respectively for the three protein structure sets. The prediction capability is superior to the existing method. This is the first structure-based approach that does not rely on evolutionary information for predicting FMN-interacting residues. The webserver for the prediction is available at http://ismblab.genomics.sinica.edu.tw/.
KW - Computational method
KW - Drug discovery
KW - Functional annotation
KW - Machine learning
KW - Structure-based
UR - https://www.scopus.com/pages/publications/84892868277
UR - https://www.scopus.com/pages/publications/84892868277#tab=citedBy
U2 - 10.1016/j.jtbi.2013.10.020
DO - 10.1016/j.jtbi.2013.10.020
M3 - Article
C2 - 24211525
AN - SCOPUS:84892868277
SN - 0022-5193
VL - 343
SP - 154
EP - 161
JO - Journal of Theoretical Biology
JF - Journal of Theoretical Biology
ER -