TY - JOUR
T1 - Optimizing Hyperparameter Tuning in Machine Learning to Improve the Predictive Performance of Cross-Species N6-Methyladenosine Sites
AU - Le, Nguyen Quoc Khanh
AU - Xu, Ling
N1 - Publisher Copyright:
© 2023 The Authors. Published by American Chemical Society
PY - 2023/10/24
Y1 - 2023/10/24
N2 - DNA N6-methyladenosine (6 mA) modification carries significant epigenetic information and plays a pivotal role in biological functions, thereby profoundly impacting human development. Precise and reliable detection of 6 mA sites is integral to understanding the mechanisms underpinning DNA modification. The present methods, primarily experimental, used to identify specific molecular sites are often time-intensive and costly. Consequently, the rise of computer-based methods aimed at identifying 6 mA sites provides a welcome alternative. Our research introduces a novel model to discern DNA 6 mA sites in cross-species genomes. This model, developed through machine learning, utilizes extracted sequence information. Hyperparameter tuning was employed to ascertain the most effective feature combination and model implementation, thereby garnering vital information from sequences. Our model demonstrated superior accuracy compared to the existing models when tested using five-fold cross-validation. Thus, our study substantiates the reliability and efficiency of our model as a valuable tool for supplementing experimental research.
AB - DNA N6-methyladenosine (6 mA) modification carries significant epigenetic information and plays a pivotal role in biological functions, thereby profoundly impacting human development. Precise and reliable detection of 6 mA sites is integral to understanding the mechanisms underpinning DNA modification. The present methods, primarily experimental, used to identify specific molecular sites are often time-intensive and costly. Consequently, the rise of computer-based methods aimed at identifying 6 mA sites provides a welcome alternative. Our research introduces a novel model to discern DNA 6 mA sites in cross-species genomes. This model, developed through machine learning, utilizes extracted sequence information. Hyperparameter tuning was employed to ascertain the most effective feature combination and model implementation, thereby garnering vital information from sequences. Our model demonstrated superior accuracy compared to the existing models when tested using five-fold cross-validation. Thus, our study substantiates the reliability and efficiency of our model as a valuable tool for supplementing experimental research.
UR - http://www.scopus.com/inward/record.url?scp=85176743689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85176743689&partnerID=8YFLogxK
U2 - 10.1021/acsomega.3c05074
DO - 10.1021/acsomega.3c05074
M3 - Article
AN - SCOPUS:85176743689
SN - 2470-1343
VL - 8
SP - 39420
EP - 39426
JO - ACS Omega
JF - ACS Omega
IS - 42
ER -