Eat Weight Disord. 2026 Feb 9. doi: 10.1007/s40519-026-01821-z. Online ahead of print.
ABSTRACT
PURPOSE: The relationship between dietary nutrient intake and metabolically healthy obesity (MHO) remains poorly understood. This study aimed to construct machine learning models to predict MHO based on dietary nutrient profiles and to identify the most influential nutrients contributing to this phenotype.
METHODS: Data were derived from the U.S. National Health and Nutrition Examination Survey (NHANES) 2005-2018. Forty-five dietary nutrients, along with demographic and lifestyle variables, were included in two predictive frameworks: a dietary-only model and a complete model. Feature preprocessing involved assessing mixture effects, removing multicollinear variables, addressing class imbalance, and selecting important predictors. Six machine learning algorithms-random forest (RF), light gradient-boosting machine, k-nearest neighbor, Naive Bayes, support vector machine, and eXtreme Gradient Boosting (XGBoost)-were developed and benchmarked to compare performance. Model interpretability was examined using SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME).
RESULTS: A total of 8914 participants, including 475 classified as having MHO, were analyzed. The Random Forest model exhibited the best predictive performance in the complete model, achieving training and validation AUCs of 0.986 and 0.991, respectively. In contrast, XGBoost demonstrated superior performance in the dietary-only model, with AUCs of 0.971 and 0.988. SHAP and LIME analyses revealed that added vitamin B12, lycopene, caffeine, theobromine, and lutein/zeaxanthin were the strongest positive predictors in the complete model. When only dietary factors were considered, lycopene, lutein/zeaxanthin, magnesium, potassium, and selenium emerged as the most influential nutrients.
CONCLUSIONS: RF and XGBoost models provided the highest predictive accuracy for MHO using complete and dietary feature sets, respectively. The consistent findings from SHAP and LIME analyses emphasized lycopene and lutein/zeaxanthin as reliable and biologically relevant key predictors of metabolically healthy obesity.
LEVEL OF EVIDENCE: Level III, well-designed cohort or case-control analytic study.
PMID:41661492 | DOI:10.1007/s40519-026-01821-z