TY - JOUR
T1 - Development and validation of a machine learning model for predicting drug-drug interactions with oral diabetes medications
AU - Kha, Quang Hien
AU - Nguyen, Ngan Thi Kim
AU - Le, Nguyen Quoc Khanh
AU - Kang, Jiunn Horng
N1 - Publisher Copyright:
© 2024 Elsevier Inc.
PY - 2024/12
Y1 - 2024/12
N2 - Diabetes management is often complicated by comorbidities, requiring complex medication regimens that increase the risk of drug-drug interactions (DDIs), potentially compromising treatment outcomes or causing toxicity. Although machine learning (ML) models have made strides in DDI prediction, existing approaches lack specificity for oral diabetes medications and face challenges in interpretability. To address these limitations, we propose a novel ML-based framework utilizing the Simplified Molecular Input Line Entry System (SMILES) to encode structural information of oral diabetes drugs. Using this representation, we developed an XGBoost model, selecting molecular features through LASSO. Our dataset, sourced from DrugBank, included 42 oral diabetes drugs and 1,884 interacting drugs, divided into training, validation, and testing sets. The model identified 606 optimal features, achieving an F1-score of 0.8182. SHAP analysis was employed for feature interpretation, enhancing model transparency and clinical relevance. By predicting adverse DDIs, our model offers a valuable tool for clinical decision-making, aiding safer prescription practices. The 606 critical features provide insights into atomic-level interactions, linking computational predictions with biological experiments. We present a classification model specifically designed for predicting DDIs associated with oral diabetes medications, with an openly accessible web application to support diabetes management in multi-drug regimens and comorbidity settings.
AB - Diabetes management is often complicated by comorbidities, requiring complex medication regimens that increase the risk of drug-drug interactions (DDIs), potentially compromising treatment outcomes or causing toxicity. Although machine learning (ML) models have made strides in DDI prediction, existing approaches lack specificity for oral diabetes medications and face challenges in interpretability. To address these limitations, we propose a novel ML-based framework utilizing the Simplified Molecular Input Line Entry System (SMILES) to encode structural information of oral diabetes drugs. Using this representation, we developed an XGBoost model, selecting molecular features through LASSO. Our dataset, sourced from DrugBank, included 42 oral diabetes drugs and 1,884 interacting drugs, divided into training, validation, and testing sets. The model identified 606 optimal features, achieving an F1-score of 0.8182. SHAP analysis was employed for feature interpretation, enhancing model transparency and clinical relevance. By predicting adverse DDIs, our model offers a valuable tool for clinical decision-making, aiding safer prescription practices. The 606 critical features provide insights into atomic-level interactions, linking computational predictions with biological experiments. We present a classification model specifically designed for predicting DDIs associated with oral diabetes medications, with an openly accessible web application to support diabetes management in multi-drug regimens and comorbidity settings.
KW - Comorbidity Management
KW - Drug-Drug Interactions
KW - eXtreme Gradient Boosting
KW - Machine Learning
KW - Oral Diabetes Medications
KW - Simplified Molecular Input Line Entry System
UR - http://www.scopus.com/inward/record.url?scp=85208184904&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85208184904&partnerID=8YFLogxK
U2 - 10.1016/j.ymeth.2024.10.012
DO - 10.1016/j.ymeth.2024.10.012
M3 - Article
C2 - 39489198
AN - SCOPUS:85208184904
SN - 1046-2023
VL - 232
SP - 81
EP - 88
JO - Methods
JF - Methods
ER -