Nevin Manimala Statistics

The construction of machine learning-based predictive models for high-quality embryo formation in poor ovarian response patients with progestin-primed ovarian stimulation

Reprod Biol Endocrinol. 2024 Jul 10;22(1):78. doi: 10.1186/s12958-024-01251-5.


OBJECTIVE: To explore the optimal models for predicting the formation of high-quality embryos in Poor Ovarian Response (POR) Patients with Progestin-Primed Ovarian Stimulation (PPOS) using machine learning algorithms.

METHODS: A retrospective analysis was conducted on the clinical data of 4,216 POR cycles who underwent in vitro fertilization (IVF) / intracytoplasmic sperm injection (ICSI) at Sichuan Jinxin Xinan Women and Children’s Hospital from January 2015 to December 2021. Based on the presence of high-quality cleavage embryos 72 h post-fertilization, the samples were divided into the high-quality cleavage embryo group (N = 1950) and the non-high-quality cleavage embryo group (N = 2266). Additionally, based on whether high-quality blastocysts were observed following full blastocyst culture, the samples were categorized into the high-quality blastocyst group (N = 124) and the non-high-quality blastocyst group (N = 1800). The factors influencing the formation of high-quality embryos were analyzed using logistic regression. The predictive models based on machine learning methods were constructed and evaluated accordingly.

RESULTS: Differential analysis revealed that there are statistically significant differences in 14 factors between high-quality and non-high-quality cleavage embryos. Logistic regression analysis identified 14 factors as influential in forming high-quality cleavage embryos. In models excluding three variables (retrieved oocytes, MII oocytes, and 2PN fertilized oocytes), the XGBoost model performed slightly better (AUC = 0.672, 95% CI = 0.636-0.708). Conversely, in models including these three variables, the Random Forest model exhibited the best performance (AUC = 0.788, 95% CI = 0.759-0.818). In the analysis of high-quality blastocysts, significant differences were found in 17 factors. Logistic regression analysis indicated that 13 factors influence the formation of high-quality blastocysts. Including these variables in the predictive model, the XGBoost model showed the highest performance (AUC = 0.813, 95% CI = 0.741-0.884).

CONCLUSION: We developed a predictive model for the formation of high-quality embryos using machine learning methods for patients with POR undergoing treatment with the PPOS protocol. This model can help infertility patients better understand the likelihood of forming high-quality embryos following treatment and help clinicians better understand and predict treatment outcomes, thus facilitating more targeted and effective interventions.

PMID:38987797 | DOI:10.1186/s12958-024-01251-5

By Nevin Manimala

Portfolio Website for Nevin Manimala