Improving the use of mortality data in public health: A comparison of garbage code redistribution models

Ta Chou Ng, Wei Cheng Lo, Chu Chang Ku, Tsung Hsueh Lu, Hsien Ho Lin

研究成果: 雜誌貢獻文章同行評審

3 引文 斯高帕斯(Scopus)


Objectives: To describe and compare 3 garbage code (GC) redistribution models: naïve Bayes classifier (NB), coarsened exact matching (CEM), and multinomial logistic regression (MLR). Methods: We analyzed Taiwan Vital Registration data (2008-2016) using a 2-step approach. First, we used non-GC death records to evaluate 3 different prediction models (NB, CEM, and MLR), incorporating individual-level information on multiple causes of death (MCDs) and demographic characteristics. Second, we applied the best-performing model to GC death records to predict the underlying causes of death. We conducted additional simulation analyses for evaluating the predictive performance of models. Results: When we did not account for MCDs, all 3 models presented high average misclassification rates in GC assignment (NB, 81%; CEM, 86%; MLR, 81%). In the presence of MCD information, NB and MLR exhibited significant improvement in assignment accuracy (19% and 17% misclassification rate, respectively). Furthermore, CEM without a variable selection procedure resulted in a substantially higher misclassification rate (40%). Conclusions: Comparing potential GC redistribution approaches provides guidance for obtaining better estimates of cause-of-death distribution and highlights the significance of MCD information for vital registration system reform.

頁(從 - 到)222-229
期刊American Journal of Public Health
出版狀態已發佈 - 1月 1 2020

ASJC Scopus subject areas

  • 公共衛生、環境和職業健康


深入研究「Improving the use of mortality data in public health: A comparison of garbage code redistribution models」主題。共同形成了獨特的指紋。