Xi Bao Yu Fen Zi Mian Yi Xue Za Zhi. 2025 Apr;41(4):339-347.
ABSTRACT
Objective To mine and analyze the routine blood test data of children with allergic rhinitis (AR), identify routine blood parameters related to childhood allergic rhinitis, establish an effective diagnostic model, and evaluate the performance of the model. Methods This study was a retrospective study of clinical cases. The experimental group comprised a total of 1110 children diagnosed with AR at the First Affiliated Hospital of Air Force Medical University during the period from December 12, 2020 to December 12, 2021, while the control group included 1109 children without a history of allergic rhinitis or other allergic diseases who underwent routine physical examinations during the same period. Information such as age, sex and routine blood test results was collected for all subjects. The levels of routine blood test indicators were compared between AR children and healthy children using comprehensive intelligent baseline analysis, with indicators of P≥0.05 excluded; variables were screened by Lasso regression. Binary Logistic regression was used to further evaluate the influence of multiple routine blood indexes on the results. Five kinds of machine model algorithms were used, namely extreme value gradient lift (XGBoost), logistic regression (LR), gradient lift decision tree (LGBMC), Random forest (RF) and adaptive lift algorithm (AdaBoost), to establish the diagnostic models. The receiver operating characteristic (ROC) curve was used to screen the optimal model. The best LightGBM algorithm was used to build an online patient risk assessment tool for clinical application. Results Statistically significant differences were observed between the AR group and the control group in the following routine blood test indicators: mean cellular hemoglobin concentration (MCHC), hemoglobin (HGB), absolute value of basophils (BASO), absolute value of eosinophils (EOS), large platelet ratio (P-LCR), mean platelet volume (MPV), platelet distribution width (PDW), platelet count (PLT), absolute values of leukocyte neutrophil (W-LCC), leukocyte monocyte (W-MCC), leukocyte lymphocyte (W-SCC), and age. Lasso regression identified these variables as important predictors, and binary Logistic regression further analyzed the significant influence of these variables on the results. The optimal machine learning algorithm LightGBM was used to establish a multi-index joint detection model. The model showed robust prediction performance in the training set, with AUC values of 0.8512 and 0.8103 in the internal validation set. Conclusion The identified routine blood parameters can be used as potential biomarkers for early diagnosis and risk assessment of AR, which can improve the accuracy and efficiency of diagnosis. The established model provides scientific basis for more accurate diagnostic tools and personalized prevention strategies. Future studies should prospectively validate these findings and explore their applicability in other related diseases.
PMID:40260567