Sci Rep. 2025 Dec 14. doi: 10.1038/s41598-025-28723-x. Online ahead of print.
ABSTRACT
Chest radiography (CXR is widely used for triage and follow-up of pulmonary disease, yet COVID-19 classification remains vulnerable to bias, label noise, and domain shift. We propose a multi-stage Bayesian deep learning framework that combines lung segmentation, segmentation-guided classification, calibrated ensembling, and uncertainty estimation to classify four classes (COVID-19, normal, viral pneumonia, bacterial pneumonia) and to grade COVID-19 severity. Models are trained and tested on 1,531 CXRs (100 COVID-19 images from 70 patients; 1,431 non-COVID images from ChestX-ray14) with patient-wise splits. The final ensemble achieves 98.33% test accuracy; COVID-19 sensitivity reaches 100% on this split. Robustness is quantified by stress-testing five image degradations (Gaussian noise, motion/defocus blur, JPEG compression, and downsampling), with macro AUC drops remaining small at moderate severities and larger under strong blur or heavy downsampling. Saliency and context-relevance analyses are used to identify spurious cues. The study is limited by dataset size and lack of external multi-site validation; a planned evaluation on COVIDx and BIMCV-COVID19 + is outlined.
PMID:41392298 | DOI:10.1038/s41598-025-28723-x