Abstract
Considerable sequence data are produced in genome annotation projects that relate to molecular levels, structural similarities, and molecular and biological functions. In structural genomics, the most essential task involves resolving protein structures efficiently with hardware or software, understanding these structures, and assigning their biological functions. Understanding the characteristics and functions of proteins enables the exploration of the molecular mechanisms of life. In this paper, we examine the problems of protein classification. Because they perform similar biological functions, proteins in the same family usually share similar structural characteristics. We employed this premise in designing a classification algorithm. In this algorithm, auxiliary graphs are used to represent proteins, with every amino acid in a protein to a vertex in a graph. Moreover, the links between amino acids correspond to the edges between the vertices. The proposed algorithm classifies proteins according to the similarities in their graphical structures. The proposed algorithm is efficient and accurate in distinguishing proteins from different families and outperformed related algorithms experimentally.
Original language | English |
---|---|
Pages (from-to) | 3135-3143 |
Number of pages | 9 |
Journal | IEEE/ACM Transactions on Computational Biology and Bioinformatics |
Volume | 19 |
Issue number | 6 |
DOIs | |
Publication status | Published - Nov 2022 |
Keywords
- Amino acids
- B-factors
- bioinformatic algorithms
- Biology
- Computer science
- protein classification
- protein structures
- Proteins
- Roads
- structural similarities
- Support vector machines
- Three-dimensional displays
ASJC Scopus subject areas
- Biotechnology
- Genetics
- Applied Mathematics