TY - JOUR
T1 - Deep Learning Algorithms for Detection of Diabetic Retinopathy in Retinal Fundus Photographs: A Systematic Review and Meta-Analysis
AU - Islam, Md Mohaimenul
AU - Yang, Hsuan-Chia
AU - Poly, Tahmina Nasrin
AU - Jian, Wen-Shan
AU - Li, Yu-Chuan (Jack)
PY - 2020/7
Y1 - 2020/7
N2 - Background : Diabetic retinopathy (DR) is one of the leading causes of blindness globally. Earlier detection and timely treatment of DR are desirable to reduce the incidence and progression of vision loss. Currently, deep learning (DL) approaches have offered better performance in detecting DR from retinal fundus images. We, therefore, performed a systematic review with a meta-analysis of relevant studies to quantify the performance of DL algorithms for detecting DR. Methods : A systematic literature search on EMBASE, PubMed, Google Scholar, Scopus was performed between January 1, 2000, and March 31, 2019. The search strategy was based on the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) reporting guidelines, and DL-based study design was mandatory for articles inclusion. Two independent authors screened abstracts and titles against inclusion and exclusion criteria. Data were extracted by two authors independently using a standard form and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool was used for the risk of bias and applicability assessment. Results : Twenty-three studies were included in the systematic review; 20 studies met inclusion criteria for the meta-analysis. The pooled area under the receiving operating curve (AUROC) of DR was 0.97 (95%CI: 0.95-0.98), sensitivity was 0.83 (95%CI: 0.83-0.83), and specificity was 0.92 (95%CI: 0.92-0.92). The positive- and negative-likelihood ratio were 14.11 (95%CI: 9.91-20.07), and 0.10 (95%CI: 0.07-0.16), respectively. Moreover, the diagnostic odds ratio for DL models was 136.83 (95%CI: 79.03-236.93). All the studies provided a DR-grading scale, a human grader (e.g. trained caregivers, ophthalmologists) as a reference standard. Conclusion : The findings of our study showed that DL algorithms had high sensitivity and specificity for detecting referable DR from retinal fundus photographs. Applying a DL-based automated tool of assessing DR from color fundus images could provide an alternative solution to reduce misdiagnosis and improve workflow. A DL-based automated tool offers substantial benefits to reduce screening costs, accessibility to healthcare and ameliorate earlier treatments.
AB - Background : Diabetic retinopathy (DR) is one of the leading causes of blindness globally. Earlier detection and timely treatment of DR are desirable to reduce the incidence and progression of vision loss. Currently, deep learning (DL) approaches have offered better performance in detecting DR from retinal fundus images. We, therefore, performed a systematic review with a meta-analysis of relevant studies to quantify the performance of DL algorithms for detecting DR. Methods : A systematic literature search on EMBASE, PubMed, Google Scholar, Scopus was performed between January 1, 2000, and March 31, 2019. The search strategy was based on the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) reporting guidelines, and DL-based study design was mandatory for articles inclusion. Two independent authors screened abstracts and titles against inclusion and exclusion criteria. Data were extracted by two authors independently using a standard form and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool was used for the risk of bias and applicability assessment. Results : Twenty-three studies were included in the systematic review; 20 studies met inclusion criteria for the meta-analysis. The pooled area under the receiving operating curve (AUROC) of DR was 0.97 (95%CI: 0.95-0.98), sensitivity was 0.83 (95%CI: 0.83-0.83), and specificity was 0.92 (95%CI: 0.92-0.92). The positive- and negative-likelihood ratio were 14.11 (95%CI: 9.91-20.07), and 0.10 (95%CI: 0.07-0.16), respectively. Moreover, the diagnostic odds ratio for DL models was 136.83 (95%CI: 79.03-236.93). All the studies provided a DR-grading scale, a human grader (e.g. trained caregivers, ophthalmologists) as a reference standard. Conclusion : The findings of our study showed that DL algorithms had high sensitivity and specificity for detecting referable DR from retinal fundus photographs. Applying a DL-based automated tool of assessing DR from color fundus images could provide an alternative solution to reduce misdiagnosis and improve workflow. A DL-based automated tool offers substantial benefits to reduce screening costs, accessibility to healthcare and ameliorate earlier treatments.
KW - Deep learning
KW - Diabetic
KW - Diabetic retinopathy
KW - Fundus photograph
KW - Retinopathy
UR - http://www.scopus.com/inward/record.url?scp=85079855206&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85079855206&partnerID=8YFLogxK
U2 - 10.1016/j.cmpb.2020.105320
DO - 10.1016/j.cmpb.2020.105320
M3 - Article
SN - 0169-2607
VL - 191
SP - 105320
JO - Computer Methods and Programs in Biomedicine
JF - Computer Methods and Programs in Biomedicine
M1 - 105320
ER -