JMIR Med Inform. 2025 Jul 4;13:e62710. doi: 10.2196/62710.
ABSTRACT
BACKGROUND: Concerns have been expressed about the abundance of new clinical prediction models (CPMs) proposed in the literature. However, the extent of this proliferation in prediction research remains unclear.
OBJECTIVE: This study aimed to estimate the total and annual number of CPM development-related publications available across all medical fields.
METHODS: Using a validated search strategy, we conducted a systematic search of literature for prediction model studies published in Pubmed and Embase between 1995 and the end of 2020. By taking random samples for each year, we identified eligible studies that developed a multivariable model (ie, diagnostic or prognostic) for individual-level prediction of a health outcome across all medical fields. Exclusion criteria included development of models with a single predictor, studies not involving humans, methodological studies, conference abstracts, articles with unavailable full text, and those not available in English. We estimated the total and annual number of published regression-based multivariable CPM development articles, based on the total number of publications, proportion of included articles, and the search sensitivity. Furthermore, we used an adjusted Poisson regression to extrapolate our results to the period 1950-2024. Additionally, we estimated the number of articles that developed CPMs using techniques other than regression (eg, machine learning).
RESULTS: From a random sample of 10,660 articles published between 1995 and 2020, 109 regression-based CPM development articles were included. We estimated that 82,772 (95% CI 65,313-100,231) CPM development articles using regression were published, with an acceleration in model development from 2010 onward. With the addition of articles that developed non-regression-based CPMs, the number increased to 147,714 (95% CI 125,201-170,226). After extrapolation to the years 1950-2024, the number of articles increased to 156,673 and 248,431 for regression-based models and total CPMs, respectively.
CONCLUSIONS: Based on a representative sample of publications from the literature, we estimated that nearly 250,000 articles reporting the development of CPMs across all medical fields were published until 2024. CPM development-related publications continue to increase in number. To prevent research waste and close the gap between research and clinical practice, focus should shift away from developing new CPMs to facilitating model validation and impact assessment of the plethora of existing CPMs. Limitations of this study include restriction of search to articles available in English and development of the validated search strategy prior to the popularity of artificial intelligence and machine learning models.
PMID:40614260 | DOI:10.2196/62710