TY - JOUR
T1 - Identification of time-invariant biomarkers for non-genotoxic hepatocarcinogen assessment
AU - Huang, Shan Han
AU - Lin, Ying Chi
AU - Tung, Chun Wei
N1 - Funding Information:
This work was supported by the Ministry of Science and Technology of Taiwan (MOST107-2221-E-038-020-MY3), National Health Research Institutes (NHRI-109A1-EMCO-0319204), and Taipei Medical University (TMU108-AE1-B36).
Publisher Copyright:
© 2020 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2020/6/2
Y1 - 2020/6/2
N2 - Non-genotoxic hepatocarcinogens (NGHCs) can only be confirmed by 2-year rodent studies. Toxicogenomics (TGx) approaches using gene expression profiles from short-term animal studies could enable early assessment of NGHCs. However, high variance in the modulation of the genes had been noted among exposure styles and datasets. Expanding from our previous strategy in identifying consensus biomarkers in multiple experiments, we aimed to identify time-invariant biomarkers for NGHCs in short-term exposure styles and validate their applicability to long-term exposure styles. In this study, nine time-invariant biomarkers, namely A2m, Akr7a3, Aqp7, Ca3, Cdc2a, Cdkn3, Cyp2c11, Ntf3, and Sds, were identified from four large-scale microarray datasets. Machine learning techniques were subsequently employed to assess the prediction performance of the biomarkers. The biomarker set along with the Random Forest models gave the highest median area under the receiver operating characteristic curve (AUC) of 0.824 and a low interquartile range (IQR) variance of 0.036 based on a leave-one-out cross-validation. The application of the models to the external validation datasets achieved high AUC values of greater than or equal to 0.857. Enrichment analysis of the biomarkers inferred the involvement of chronic inflammatory diseases such as liver cirrhosis, fibrosis, and hepatocellular carcinoma in NGHCs. The time-invariant biomarkers provided a robust alternative for NGHC prediction.
AB - Non-genotoxic hepatocarcinogens (NGHCs) can only be confirmed by 2-year rodent studies. Toxicogenomics (TGx) approaches using gene expression profiles from short-term animal studies could enable early assessment of NGHCs. However, high variance in the modulation of the genes had been noted among exposure styles and datasets. Expanding from our previous strategy in identifying consensus biomarkers in multiple experiments, we aimed to identify time-invariant biomarkers for NGHCs in short-term exposure styles and validate their applicability to long-term exposure styles. In this study, nine time-invariant biomarkers, namely A2m, Akr7a3, Aqp7, Ca3, Cdc2a, Cdkn3, Cyp2c11, Ntf3, and Sds, were identified from four large-scale microarray datasets. Machine learning techniques were subsequently employed to assess the prediction performance of the biomarkers. The biomarker set along with the Random Forest models gave the highest median area under the receiver operating characteristic curve (AUC) of 0.824 and a low interquartile range (IQR) variance of 0.036 based on a leave-one-out cross-validation. The application of the models to the external validation datasets achieved high AUC values of greater than or equal to 0.857. Enrichment analysis of the biomarkers inferred the involvement of chronic inflammatory diseases such as liver cirrhosis, fibrosis, and hepatocellular carcinoma in NGHCs. The time-invariant biomarkers provided a robust alternative for NGHC prediction.
KW - Machine learning
KW - Non-genotoxic hepatocarcinogens
KW - Time-invariant biomarkers
KW - Toxicogenomics
UR - http://www.scopus.com/inward/record.url?scp=85086753917&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85086753917&partnerID=8YFLogxK
U2 - 10.3390/ijerph17124298
DO - 10.3390/ijerph17124298
M3 - Article
C2 - 32560183
AN - SCOPUS:85086753917
SN - 1661-7827
VL - 17
SP - 1
EP - 14
JO - International journal of environmental research and public health
JF - International journal of environmental research and public health
IS - 12
M1 - 4298
ER -