TY - GEN
T1 - Improving Allergenic Protein Prediction Using Physicochemical Features on Non-Redundant Sequences
AU - Signh, Sher
AU - Chiu, Jr Rou
AU - Sun, Kuei Ling
AU - Su, Emily Chia Yu
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/7
Y1 - 2019/7
N2 - Despite extensive studies in allergen prediction, current approaches still have room for performance improvement and suffer from the problem of lack of interpretable biological features. Thus, developments of allergen prediction method from sequences have become highly important to facilitate in silico vaccine design. In this study, we propose a systematic approach to predict allergenic proteins by incorporating sequence and physicochemical properties in machine learning algorithms. In addition, predictive performance of previous studies could be overestimated due to high redundancy in the data sets. Therefore, we reduce sequence redundancy in the data set and experiment results show that we achieve better predictive performance when compared with other approaches. This study can help discover new prophylactic and therapeutic vaccines for diseases. Moreover, we analyze immunological features that can provide valuable insights into immunotherapies of allergy and autoimmune diseases in translational bioinformatics.
AB - Despite extensive studies in allergen prediction, current approaches still have room for performance improvement and suffer from the problem of lack of interpretable biological features. Thus, developments of allergen prediction method from sequences have become highly important to facilitate in silico vaccine design. In this study, we propose a systematic approach to predict allergenic proteins by incorporating sequence and physicochemical properties in machine learning algorithms. In addition, predictive performance of previous studies could be overestimated due to high redundancy in the data sets. Therefore, we reduce sequence redundancy in the data set and experiment results show that we achieve better predictive performance when compared with other approaches. This study can help discover new prophylactic and therapeutic vaccines for diseases. Moreover, we analyze immunological features that can provide valuable insights into immunotherapies of allergy and autoimmune diseases in translational bioinformatics.
KW - Allergen prediction
KW - Machine learning algorithms
KW - Physicochemical features
KW - Sequence patterns
UR - http://www.scopus.com/inward/record.url?scp=85078540029&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078540029&partnerID=8YFLogxK
U2 - 10.1109/ICMLC48188.2019.8949197
DO - 10.1109/ICMLC48188.2019.8949197
M3 - Conference contribution
AN - SCOPUS:85078540029
T3 - Proceedings - International Conference on Machine Learning and Cybernetics
BT - Proceedings of 2019 International Conference on Machine Learning and Cybernetics, ICMLC 2019
PB - IEEE Computer Society
T2 - 18th International Conference on Machine Learning and Cybernetics, ICMLC 2019
Y2 - 7 July 2019 through 10 July 2019
ER -