Categories
Nevin Manimala Statistics

Leveraging multi-source data and teleconnection indices for enhanced runoff prediction using coupled deep learning models

Sci Rep. 2025 Apr 27;15(1):14732. doi: 10.1038/s41598-025-00115-1.

ABSTRACT

Accurate medium- to long-term runoff forecasting is crucial for flood control, drought resilience, water resources development, and ecological improvement. Traditional statistical methods struggle to utilize multifaceted variable information, leading to lower prediction accuracy. This study introduces two innovative coupled models-SRA-SVR and SRA-MLPR-to enhance runoff prediction by leveraging the strengths of statistical and deep learning approaches. Stepwise Regression Analysis (SRA) was employed to effectively handle high-dimensional data and multicollinearity, ensuring that only the most influential predictive variables were retained. Support Vector Regression (SVR) and Multi-Layer Perceptron Regression (MLPR) were chosen due to their strong adaptability in capturing nonlinear relationships and extracting latent hydrological patterns. The integration of these methods significantly improves prediction accuracy and model stability. By integrating 80 atmospheric circulation indices as teleconnection variables, the models tackle critical challenges such as high-dimensional data, multicollinearity, and nonlinear hydrological dynamics. The Yalong River Basin, characterized by complex hydrological processes and diverse climatic influences, serves as the case study for model validation. The results show that: (1) Compared to baseline single models, the SRA-MLPR model reduced RMSE (from 798.47 to 594.45) by 26% and MAPE (from 34.79 to 22.90%) by 34%, while achieving an NSE (from 0.67 to 0.76) improvement of 13%, particularly excelling in extreme runoff scenarios. (2) The inclusion of teleconnection indices not only enriched the predictive feature set but also improved model stability, with the SRA-MLPR demonstrating enhanced capability in capturing latent nonlinear relationships. (3) A one-month lag in atmospheric circulation indices was identified as the optimal predictor for basin-scale runoff, providing actionable insights into temporal runoff dynamics. (4) To enhance model interpretability, SHAP (SHapley Additive exPlanations) analysis was employed to quantify the contribution of atmospheric circulation indices to runoff predictions, revealing the dominant climate drivers and their nonlinear interactions. The results indicate that the Northern Hemisphere Polar Vortex and the East Asian Trough exert significant control over runoff dynamics, with their influence modulated by large-scale climate oscillations such as the North Atlantic Oscillation (NAO) and Pacific Decadal Oscillation (PDO). (5) The models’ scalability is validated through their modular design, allowing seamless adaptation to diverse hydrological contexts. Applications include improved flood forecasting, optimized reservoir operations, and adaptive water resource planning. Furthermore, the study demonstrates the potential of coupled models as generalizable tools for hydrological forecasting in basins with varying climatic and geographic conditions. This study highlights the potential of coupled models as robust and generalizable tools for hydrological forecasting across diverse climatic and geographic conditions. By integrating atmospheric circulation indices, the proposed models enhance runoff prediction accuracy and stability while offering valuable insights for flood prevention, drought mitigation, and adaptive water resource management. These methodological advancements bridge the gap between statistical and deep learning approaches, providing a scalable framework for accurate and interpretable hydrological, climatological, and environmental predictions. Given the escalating challenges brought about by climate change, the findings of this study make contributions to sustainable water management, interpretable decision-making support, and disaster preparedness at a global level.

PMID:40289219 | DOI:10.1038/s41598-025-00115-1

Categories
Nevin Manimala Statistics

Magnetic resonance imaging for multi-factorial age estimation using Bayes’ rule: a validation study in two independent samples

Int J Legal Med. 2025 Apr 28. doi: 10.1007/s00414-025-03499-x. Online ahead of print.

ABSTRACT

OBJECTIVES: Multi-factorial age estimation (MFA) models have been developed based on Bayes’ rule, combining MRI data of the third molars (T), left wrist (W) and/or both clavicles (C). Internal cross-validated performance was reported, but external validation is needed before bringing the approach into practice. This study aimed to validate these MFA models in two independent samples.

METHODS: In the Ghent sample, W + C MRI was prospectively conducted in 108 healthy Caucasian volunteers (52 males, 56 females) aged 16 to 21 years. In the Graz sample, T + W + C MRI was prospectively conducted in 335 healthy Caucasian males aged 13 to 24 years. Development was staged and checked for intra-observer reliability, and age estimation performances were tested.

RESULTS: Staging clavicles was most prone to intra-observer variability. Applying the W + C model to Ghent males rendered a mean absolute error of 1.55 years, a root mean square error of 1.90 years, 70.6% correctly categorised adults and 94.4% correctly categorised minors. In females, the results were 1.49 years, 1.83 years, 92.1% and 66.7%, respectively. Regarding the Graz sample, the W + C results were 1.66 years, 2.08 years, 94.7% and 80.7%, respectively. For the T + W + C model, the results were 1.41 years, 1.80 years, 95.2% and 81.5%, respectively.

CONCLUSION: The T + W + C and W + C models rendered a similar accuracy of the point prediction of age in both validation samples. However, they bared a larger risk of wrongfully categorising a minor as an adult than reported for internal validation, stressing the importance of the prediction interval for age estimation in practice.

PMID:40289208 | DOI:10.1007/s00414-025-03499-x

Categories
Nevin Manimala Statistics

Association between lean body mass and osteoarthritis: a cross-sectional study from the NHANES 2007-2018

Sci Rep. 2025 Apr 27;15(1):14726. doi: 10.1038/s41598-025-98795-2.

ABSTRACT

The prevention of osteoarthritis through controlling body measurements has received increasing attention in recent years, but the relationship between lean body mass (LBM) and osteoarthritis remains unclear. Hence, we explored this association through the data from the National Health and Nutrition Examination Survey (2007-2018). The present study enrolled 31,172 participants. To explore the correlation between LBM and osteoarthritis, we utilized logistic regression equations to explore associations between covariates, exposures, and outcomes. We used multivariate regression models to further control confounding factors. Restricted cubic splines were employed to investigate non-linear relationships. And the inflection point was identified by recursive algorithm. Furthermore, stratified analyses of gender and age were conducted. Osteoarthritis was negatively correlated with LBM [0.74 (0.67, 0.80)] in the model adjusting for all confounders. Based on the restricted cubic spline curve, an inflection point of 52.26 kg was found to confirm the non-linear relationship. LBM was negatively correlated with osteoarthritis at 0.56 (0.48, 0.64) before the inflection point, and the association was not statistically significant afterward. This large-scale study revealed that LBM was non-linearly correlated with osteoarthritis in the general American population. Differences in age and gender were further identified.

PMID:40289200 | DOI:10.1038/s41598-025-98795-2

Categories
Nevin Manimala Statistics

Scale validation and prediction of environmental health literacy in Brazil

Sci Rep. 2025 Apr 27;15(1):14703. doi: 10.1038/s41598-025-98435-9.

ABSTRACT

Environmental Health Literacy (EHL) focuses on significant impact of environmental factors on human health and emphasizes the importance of public awareness and engagement in identifying and mitigating environmental health risks. This paper presents a study in the Distrito Federal, Brazil, aimed at evaluating EHL and identify the main socioeconomic characteristics capable of predicting EHL levels. Using the EHL Scale, which assesses knowledge, attitudes, and behaviors toward environmental health, this research applies a questionnaire to 397 respondents. Through descriptive statistics and confirmatory and exploratory factor analyses, the study validates the scale for the Brazilian context and offers structural adjustments for the air scale. Using the socioeconomic data we implemented a predictive Random Forest algorithm to forecast EHL levels on each of the scales. By extracting the Shapley values from the model, we established the most relevant variables to predict EHL, offering valuable insights for policymakers, health and environmental professionals to enhance public engagement with environmental health issues. The results indicate that social vulnerability features are predictive of EHL, including education, income, age, ethnicity, presence of disability and use of continuous medication. This study identifies factors that bolster policy strategies to communicate environmental health risks and promote behavior change regarding the environment and self-care.

PMID:40289196 | DOI:10.1038/s41598-025-98435-9

Categories
Nevin Manimala Statistics

Intratumoral and peritumoral radiomics signature based on DCE-MRI can distinguish between luminal and non-luminal breast cancer molecular subtypes

Sci Rep. 2025 Apr 27;15(1):14720. doi: 10.1038/s41598-025-98155-0.

ABSTRACT

Distinguishing the luminal subtypes of breast cancer (BC) remaining challenging. Thus, the aim of this study was to investigate the feasibility of radiomic signature using intratumoral and peritumoral features obtained from dynamic contrast-enhanced MRI (DCE-MRI) in preoperatively discriminating the luminal from non-luminal type in patients with BC. A total of 305 patients with pathologically confirmed BC from three hospitals were retrospectively enrolled. The LASSO method was then used for selecting features, and the radiomic score (radscore) for each patient was calculated. Based on the radscore, Radiomic signature of intratumoral, peritumoral, and combined intratumoral and peritumoral were established, respectively. The performances of the radiomic signatures were validated with receiver operator characteristic (ROC) curve and decision curve analysis. For predicting molecular subtypes, the AUC for intratumoral radiomic signature was 0.817, 0.838, and 0.883 in the training set, internal validation set, and external validation set, respectively. AUC for the peritumoral radiomic signature was 0.863, 0.895, and 0.889 in the training set, internal validation set, and external validation set, respectively. The AUC for combined intratumoral and peritumoral radiomic signature was 0.956, 0.945, and 0.896 in the training set, internal validation set, and external validation set, respectively. Additional contributing value of combined intratumoral and peritumoral radiomic signature to the intratumoral radiomic signature was statistically significant [NRI, 0.300 (95% CI: 0.117-0.482), P = 0.001 in internal validation set; NRI, 0.224 (95% CI: 0.038-0.410), P = 0.018 in external validation set]. These results indicated that the radiomic signature combining intratumoral and peritumoral features showed good performance in predicting the luminal type of breast cancer.

PMID:40289183 | DOI:10.1038/s41598-025-98155-0

Categories
Nevin Manimala Statistics

A comparison of statistical methods for deriving occupancy estimates from machine learning outputs

Sci Rep. 2025 Apr 27;15(1):14700. doi: 10.1038/s41598-025-95207-3.

ABSTRACT

The combination of autonomous recording units (ARUs) and machine learning enables scalable biodiversity monitoring. These data are often analysed using occupancy models, yet methods for integrating machine learning outputs with these models are rarely compared. Using the Yucatán black howler monkey as a case study, we evaluated four approaches for integrating ARU data and machine learning outputs into occupancy models: (i) standard occupancy models with verified data, and false-positive occupancy models using (ii) presence-absence data, (iii) counts of detections, and (iv) continuous classifier scores. We assessed estimator accuracy and the effects of decision threshold, temporal subsampling, and verification strategies. We found that classifier-guided listening with a standard occupancy model provided an accurate estimate with minimal verification effort. The false-positive models yielded similarly accurate estimates under specific conditions, but were sensitive to subjective choices including decision threshold. The inability to determine stable parameter choices a priori, coupled with the increased computational complexity of several models (i.e. the detection-count and continuous-score models), limits the practical application of false-positive models. In the case of a high-performance classifier and a readily detectable species, classifier-guided listening paired with a standard occupancy model provides a practical and efficient approach for accurately estimating occupancy.

PMID:40289178 | DOI:10.1038/s41598-025-95207-3

Categories
Nevin Manimala Statistics

Direct brain recordings reveal implicit encoding of structure in random auditory streams

Sci Rep. 2025 Apr 27;15(1):14725. doi: 10.1038/s41598-025-98865-5.

ABSTRACT

The brain excels at processing sensory input, even in rich or chaotic environments. Mounting evidence attributes this to sophisticated internal models of the environment that draw on statistical structures in the unfolding sensory input. Understanding how and where such modeling proceeds is a core question in statistical learning and predictive processing. In this context, we address the role of transitional probabilities as an implicit structure supporting the encoding of the temporal structure of a random auditory stream. Leveraging information-theoretical principles and the high spatiotemporal resolution of intracranial electroencephalography, we analyzed the trial-by-trial high-frequency activity representation of transitional probabilities. This unique approach enabled us to demonstrate how the brain automatically and continuously encodes structure in random stimuli and revealed the involvement of a network outside of the auditory system, including hippocampal, frontal, and temporal regions. Our work provides a comprehensive picture of the neural correlates of automatic encoding of implicit structure that can be the crucial substrate for the swift detection of patterns and unexpected events in the environment.

PMID:40289162 | DOI:10.1038/s41598-025-98865-5

Categories
Nevin Manimala Statistics

Topological links and knots of speckled light mediated by coherence singularities

Light Sci Appl. 2025 Apr 27;14(1):175. doi: 10.1038/s41377-025-01865-3.

ABSTRACT

Links and knots are exotic topological structures that have garnered significant interest across multiple branches of natural sciences. Coherent links and knots, such as those constructed by phase or polarization singularities of coherent light, have been observed in various three-dimensional optical settings. However, incoherent links and knots-knotted or connected lines of coherence singularities-arise from a fundamentally different concept. They are “hidden” in the statistic properties of a randomly fluctuating field, making their presence often elusive or undetectable. Here, we theoretically construct and experimentally demonstrate such topological entities of incoherent light. By leveraging a state-of-the-art incoherent modal-decomposition scheme, we unveil incoherent topological structures from fluctuating light speckles, including Hopf links and Trefoil knots of coherence singularities that are robust against coherence and intensity fluctuations. Our work is applicable to diverse wave systems where incoherence or practical coherence is prevalent, and may pave the way for design and implementation of statistically-shaped topological structures for various applications such as high-dimensional optical information encoding and optical communications.

PMID:40289134 | DOI:10.1038/s41377-025-01865-3

Categories
Nevin Manimala Statistics

Aetiological factors in molar incisor hypomineralisation: a case-control study from Salamanca, Spain

Ital J Pediatr. 2025 Apr 27;51(1):129. doi: 10.1186/s13052-025-01972-2.

ABSTRACT

BACKGROUND: Molar incisor hypomineralisation (MIH) is a developmental dental condition that causes defects in the enamel of the first molars and permanent incisors. The aim of the present study was to assess possible causal correlations between the mother-child dyad medical history and MIH.

METHODS: An observational, retrospective, case‒control pilot study was carried out at the Dental Clinic of the University of Salamanca. This study was conducted between November 2023 and May 2024. Data on potential aetiological factors were collected through personal interviews, and the children’s parents were asked aetiological questions. Statistical analysis was performed with Student’s t test and the chi-square test.

RESULTS: A total of 140 children were enrolled in the study. The case group included 70 children with MIH (31 boys and 39 girls; mean age: 9.1 ± 2.32 years), while the control group comprised 70 children without MIH (32 boys and 38 girls; mean age: 9.57 ± 3.09 years). Among the factors assessed, maternal drug allergies during pregnancy and childhood asthma were identified as potential aetiological contributors to MIH, both showing statistically significant associations (p < 0.01).

CONCLUSIONS: Within the limitations of this pilot case-control study, a potential association was observed between MIH and both maternal drug allergies during pregnancy and childhood asthma. These findings support the need for further investigation into prenatal and early-life factors that may contribute to enamel developmental disturbances. Larger prospective studies are recommended to confirm these associations and better understand the underlying mechanisms.

PMID:40289126 | DOI:10.1186/s13052-025-01972-2

Categories
Nevin Manimala Statistics

Comparative study of brain functional imaging of brain in patients with mild to moderate Alzheimer’s disease based on functional near infrared spectroscopy

BMC Neurol. 2025 Apr 28;25(1):186. doi: 10.1186/s12883-024-03989-2.

ABSTRACT

OBJECTIVE: Based on the near-infrared functional brain imaging system, this research studied the hemoglobin concentration signal in resting state and task state. The purpose of this research was to analyze the activated brain regions and functional connections by exploring the changes in hemoglobin concentration and the differences in brain network functional connections between healthy people and mild to moderate AD patients. So as to identify the cognitive dysfunction of patients at an early stage. By accurately locating the area of cognitive impairment in patients, it provides a basis for precise neural regulation of physical therapy.

METHODS: Patients who came to our hospital from January 2022 to December 2022 were recruited and selected according to the exclusion criteria. After receiving their informed consent, MMSE scale examination and near-infrared brain function imaging examination were performed in a relatively quiet environment.

RESULT: A total of 24 subjects were included in this study, including 7 in the control group (age: 72.57 ± 7.19) and 17 (age: 76.88 ± 9.29) in the AD group. The average cognitive scores were (28.00 ± 1.16), (19.24 ± 4.89), respectively. There were no statistically significant differences in gender, years of education, age, and past medical history between the AD group and the control group (P > 0.05). In the verbal fluency test (VFT) task, there was a significant difference in the activation values of the two groups in channels 01, 06, 07, 09, 13, 14, 15, 16, 19, 21, 22, 23, 27, 29, 31, 35, 38, 40, 43, 44, 45, 51, and 52II (p < 0.05), and the activation values of the normal group were greater than those of the AD group. There was a significant difference in the mean oxygenated hemoglobin concentration in channels 01, 07, 15, 16, 21, 22, 23, 31, 35, 40, 41, 44, and 48 (p < 0.05), and the average oxygenated hemoglobin concentration in the AD group was lower than that in the normal group. There was no significant difference in activation speed between the two groups. In the resting state, the number of total network edges, DLPFC-L to PreM and SMC-L, DLPFC-L to FEF-L, DLPFC-L to DLPFC-L, FPA-L to PreM and SMC-L, FPA-L to FPA-L, FPA-R to FPA-L, DLPFC-L to DLPFC-R, FEF-R to PreM and There was a statistically significant difference in the number of network edges in SMC-L (p < 0.05). Among the different groups, the number of network edges in the AD group was smaller than that in the normal group. Correlation analysis showed that T14, T31, T16, T23, T27, M16, M22, M41 (T: represents activation value, M: represents mean hemoglobin concentration, and number represents channel number). There was a positive correlation between the total number of network edges, DLPFC-L to PreM and SMC-L, DLPFC-L to DLPFC-L, FPA-L to PreM and SMC-L, FPA-L to FPA-L, DLPFC-L to DLPFC-R, FEF-R to PreM and SMC-L, and MMSE scores (p < 0.05).

DISCUSSION: In this study, we found hemodynamic changes in the prefrontal lobes of AD patients under the VFT task, and the decrease in the functional connectivity of the prefrontal brain network in AD patients in the resting state, and these changes were associated with cognitive decline in patients. Our findings suggest that fNIRS may be used as a tool for future clinical screening for cognitive impairment, and may also be used to develop personalized preventive measures and treatment plans through accurate assessment.

PMID:40289104 | DOI:10.1186/s12883-024-03989-2