TY - JOUR
T1 - Automatic Detection of Meniscus Tears Using Backbone Convolutional Neural Networks on Knee MRI
AU - Hung, Truong Nguyen Khanh
AU - Vy, Vu Pham Thao
AU - Tri, Nguyen Minh
AU - Hoang, Le Ngoc
AU - Tuan, Le Van
AU - Ho, Quang Thai
AU - Le, Nguyen Quoc Khanh
AU - Kang, Jiunn Horng
N1 - Funding Information:
This work was supported by the Ministry of Science and Technology, Taiwan (grant number MOST110‐2221‐E‐038‐001‐MY2).
Publisher Copyright:
© 2022 International Society for Magnetic Resonance in Medicine.
PY - 2023/3
Y1 - 2023/3
N2 - Background: Timely diagnosis of meniscus injuries is key for preventing knee joint dysfunction and improving patient outcomes because it decreases morbidity and facilitates treatment planning. Purpose: To train and evaluate a deep learning model for automated detection of meniscus tears on knee magnetic resonance imaging (MRI). Study type: Bicentric retrospective study. Subjects: In total, 584 knee MRI studies, divided among training (n = 234), testing (n = 200), and external validation (n = 150) data sets, were used in this study. The public data set MRNet was used as a second external validation data set to evaluate the performance of the model. Sequence: A 3 T, coronal, and sagittal images from T1-weighted proton density (PD) fast spin-echo (FSE) with fat saturation and T2-weighted FSE with fat saturation sequences. Assessment: The detection system for meniscus tear was based on the improved YOLOv4 model with Darknet-53 as the backbone. The performance of the model was also compared with that of three radiologists of varying levels of experience. The determination of the presence of a meniscus tear from surgery reports was used as the ground truth for the images. Statistical Tests: Sensitivity, specificity, prevalence, positive predictive value, negative predictive value, accuracy, and receiver operating characteristic curve were used to evaluate the performance of the detection model. Two-way analysis of variance, Wilcoxon signed-rank test, and Tukey's multiple tests were used to evaluate differences in performance between the model and radiologists. Results: The overall accuracies for detecting meniscus tears using our model on the internal testing, internal validation, and external validation data sets were 95.4%, 95.8%, and 78.8%, respectively. One radiologist had significantly lower performance than our model in detecting meniscal tears (accuracy: 0.9025 ± 0.093 vs. 0.9580 ± 0.025). Data Conclusion: The proposed model had high sensitivity, specificity, and accuracy for detecting meniscus tears on knee MRIs. Evidence Level: 3. Technical Efficacy: Stage 2.
AB - Background: Timely diagnosis of meniscus injuries is key for preventing knee joint dysfunction and improving patient outcomes because it decreases morbidity and facilitates treatment planning. Purpose: To train and evaluate a deep learning model for automated detection of meniscus tears on knee magnetic resonance imaging (MRI). Study type: Bicentric retrospective study. Subjects: In total, 584 knee MRI studies, divided among training (n = 234), testing (n = 200), and external validation (n = 150) data sets, were used in this study. The public data set MRNet was used as a second external validation data set to evaluate the performance of the model. Sequence: A 3 T, coronal, and sagittal images from T1-weighted proton density (PD) fast spin-echo (FSE) with fat saturation and T2-weighted FSE with fat saturation sequences. Assessment: The detection system for meniscus tear was based on the improved YOLOv4 model with Darknet-53 as the backbone. The performance of the model was also compared with that of three radiologists of varying levels of experience. The determination of the presence of a meniscus tear from surgery reports was used as the ground truth for the images. Statistical Tests: Sensitivity, specificity, prevalence, positive predictive value, negative predictive value, accuracy, and receiver operating characteristic curve were used to evaluate the performance of the detection model. Two-way analysis of variance, Wilcoxon signed-rank test, and Tukey's multiple tests were used to evaluate differences in performance between the model and radiologists. Results: The overall accuracies for detecting meniscus tears using our model on the internal testing, internal validation, and external validation data sets were 95.4%, 95.8%, and 78.8%, respectively. One radiologist had significantly lower performance than our model in detecting meniscal tears (accuracy: 0.9025 ± 0.093 vs. 0.9580 ± 0.025). Data Conclusion: The proposed model had high sensitivity, specificity, and accuracy for detecting meniscus tears on knee MRIs. Evidence Level: 3. Technical Efficacy: Stage 2.
KW - Darknet53
KW - knee MR image
KW - meniscus tear
KW - object detection
KW - YOLOv4
UR - http://www.scopus.com/inward/record.url?scp=85131035757&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85131035757&partnerID=8YFLogxK
U2 - 10.1002/jmri.28284
DO - 10.1002/jmri.28284
M3 - Article
AN - SCOPUS:85131035757
SN - 1053-1807
VL - 57
SP - 740
EP - 749
JO - Journal of Magnetic Resonance Imaging
JF - Journal of Magnetic Resonance Imaging
IS - 3
ER -