TY - JOUR
T1 - Leveraging federated learning for boosting data privacy and performance in IVF embryo selection
AU - Lee, Chun I.
AU - Tzeng, Chii Ruey
AU - Li, Monty
AU - Lai, Hsing Hua
AU - Chen, Chi Huang
AU - Huang, Yulun
AU - Chang, T. Arthur
AU - Chen, Chien Hong
AU - Huang, Chun Chia
AU - Lee, Maw Sheng
AU - Liu, Mark
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2024/7
Y1 - 2024/7
N2 - Purpose: To study the effectiveness of federated learning in in vitro fertilization on embryo evaluation tasks. Methods: This is a retrospective cohort analysis. Two datasets were used in this study. The ploidy status dataset consisted of 10,065 embryo records, 3760 treatments, and 2479 infertile couples from 5 hospitals. The clinical pregnancy dataset consisted of 4495 embryo records, 4495 treatments, and 3704 infertile couples from 4 hospitals. Federated learning and the gradient boosting decision tree algorithm were utilized for modeling. Results: On the ploidy status dataset, the areas under the receiver operating characteristic curves of our model trained with federated learning were 71.78%, 73.10%, 69.39%, 69.72%, and 73.46% for 5 hospitals respectively, showing an average increase of 2.5% compared to those of our model trained without federated learning. On the clinical pregnancy dataset, the areas under the receiver operating characteristic curves of our model trained with federated learning were 72.03%, 56.77%, 61.63%, and 58.58% for 4 hospitals respectively, showing an average increase of 3.08%. Conclusions: Federated learning can improve data privacy and data security and meanwhile improve the performance of embryo selection tasks by leveraging data from multiple sources. This study demonstrates the effectiveness of federated learning in embryo evaluation, and the results show the promise for future application.
AB - Purpose: To study the effectiveness of federated learning in in vitro fertilization on embryo evaluation tasks. Methods: This is a retrospective cohort analysis. Two datasets were used in this study. The ploidy status dataset consisted of 10,065 embryo records, 3760 treatments, and 2479 infertile couples from 5 hospitals. The clinical pregnancy dataset consisted of 4495 embryo records, 4495 treatments, and 3704 infertile couples from 4 hospitals. Federated learning and the gradient boosting decision tree algorithm were utilized for modeling. Results: On the ploidy status dataset, the areas under the receiver operating characteristic curves of our model trained with federated learning were 71.78%, 73.10%, 69.39%, 69.72%, and 73.46% for 5 hospitals respectively, showing an average increase of 2.5% compared to those of our model trained without federated learning. On the clinical pregnancy dataset, the areas under the receiver operating characteristic curves of our model trained with federated learning were 72.03%, 56.77%, 61.63%, and 58.58% for 4 hospitals respectively, showing an average increase of 3.08%. Conclusions: Federated learning can improve data privacy and data security and meanwhile improve the performance of embryo selection tasks by leveraging data from multiple sources. This study demonstrates the effectiveness of federated learning in embryo evaluation, and the results show the promise for future application.
KW - Clinical pregnancy
KW - Federated learning
KW - Gradient boosting decision tree
KW - In vitro fertilization
KW - Ploidy status
UR - http://www.scopus.com/inward/record.url?scp=85195407145&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85195407145&partnerID=8YFLogxK
U2 - 10.1007/s10815-024-03148-z
DO - 10.1007/s10815-024-03148-z
M3 - Article
C2 - 38834757
AN - SCOPUS:85195407145
SN - 1058-0468
VL - 41
SP - 1811
EP - 1820
JO - Journal of Assisted Reproduction and Genetics
JF - Journal of Assisted Reproduction and Genetics
IS - 7
ER -