Categories
Nevin Manimala Statistics

Predicting the diabetic foot in patients with type 2 diabetes mellitus based on machine learning

Biomed Eng Online. 2026 Jan 16. doi: 10.1186/s12938-025-01494-2. Online ahead of print.

ABSTRACT

BACKGROUND: Diabetic foot (DF) is a severe complication of type 2 diabetes mellitus (T2DM), contributing to significant morbidity and healthcare costs globally. Early prediction and intervention are critical for preventing amputations and improving patient outcomes. However, traditional statistical methods lack the capacity to handle high-dimensional clinical data and identify optimal predictive features. This study aimed to develop and validate machine learning models for DF risk prediction using feature selection strategies based on binary logistic regression and information theory.

METHODS: A retrospective cohort of 1,179 patients (95 DF cases, 1,084 T2DM controls) was analyzed using clinical and biochemical data from 2019 to 2025. Three data sets were constructed: (1) original features; (2) features selected via binary logistic regression (F1); and (3) features selected via information-theoretic global learning (F2). Six models-extreme learning machine (ELM), kernel extreme learning machine (KELM), and their variants trained on the three data sets-were evaluated using fivefold cross-validation. Performance metrics included area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and computational efficiency.

RESULTS: Age, blood-urea-nitrogen (BUN), homocysteine (Hcy), albumin (ALB), and fasting blood glucose (FBG) were identified as independent DF risk factors. The information theory-based KELM (IT-KELM) model achieved the highest AUC of 0.799 (sensitivity: 0.792 and specificity: 0.710) on F2, outperforming other models. Feature selection improved predictive accuracy while reducing computational time, with IT-KELM requiring 0.138 s for training and 0.0023 s for testing. The SHAP summary dot plot and bar chart revealed that the top five features contributing to the model were TP, RBC, ALB, BMI and HB.

CONCLUSIONS: Integrating information theory with KELM enhances DF risk prediction by optimizing feature subsets and leveraging nonlinear kernel mapping. The IT-KELM model demonstrates robust diagnostic performance and clinical feasibility for early DF screening. Future multi-center studies are needed to validate generalizability and refine model interpretability in real-world settings. This approach provides a cost-effective tool for precision medicine in diabetes care.

PMID:41546000 | DOI:10.1186/s12938-025-01494-2

By Nevin Manimala

Portfolio Website for Nevin Manimala