Introduction Assisted reproductive technology has been proposed for women with infertility. Moreover, in vitro fertilization (IVF) cycles are increasing. Factors contributing to successful pregnancy have been widely explored. In this study, we used machine learning algorithms to construct prediction models for clinical pregnancies in IVF. Materials and methods A total of 24,730 patients entered IVF and intracytoplasmic sperm injection cycles with clinical pregnancy outcomes at Taipei Medical University Hospital. Data used included patient characteristics and treatment. We used machine learning methods to develop prediction models for clinical pregnancy and explored how each variable affects the outcome of interest using partial dependence plots. Results Experimental results showed that the random forest algorithm outperforms logistic regression in terms of areas under the receiver operating characteristics curve. The ovarian stimulation protocol is the most important factor affecting pregnancy outcomes. Long and ultralong protocols have shown positive effects on clinical pregnancy among all protocols. Furthermore, total frozen and transferred embryos are positive for a clinical pregnancy, but female age and duration of infertility have negative effects on clinical pregnancy. Conclusion Our findings show the importance of variables and propensity of each variable by random forest algorithm for clinical pregnancy in the assisted reproductive technology cycle. This study provides a ranking of variables affecting clinical pregnancy and explores the effects of each treatment on successful pregnancy. Our study has the potential to help clinicians evaluate the success of IVF in patients.
ASJC Scopus subject areas