Categories
Nevin Manimala Statistics

Assessing and mitigating demographic bias in large language models for diagnostic radiology

Jpn J Radiol. 2026 Jun 5. doi: 10.1007/s11604-026-02021-6. Online ahead of print.

ABSTRACT

PURPOSE: Large language models (LLMs) are increasingly integrated into radiology workflows, but their demographic biases have not been evaluated in diagnostic radiology. This study aimed to investigate racial and sex biases in the diagnostic performance of LLMs (text-only and vision models) in radiology and to evaluate whether prompting strategies mitigate these biases.

MATERIALS AND METHODS: This retrospective study included consecutive Diagnosis Please cases published in Radiology from April 1998 to October 2024, excluding cases with sex-specific diseases. For each case, eight race-sex scenarios were generated by altering four race/ethnicity categories (Asian, Black, Hispanic, White) and two sex categories (male, female). Three LLMs (GPT-5, Claude Sonnet 4.5, Gemini 2.5 Flash) were evaluated as text-only models (medical history and imaging findings) and vision models (medical history and images) using three prompting strategies (basic, self-consistency, chain-of-thought prompting). Generalized estimating equations were used to compare diagnostic accuracy across race/ethnicity, sex, and prompting strategies.

RESULTS: A total of 286 cases were included. Across models and conditions, ten significant race-related and four significant sex-related differences in diagnostic accuracy were observed. Among the four race/ethnicity groups, Black patients were most likely to have significantly lower accuracy (four of the ten statistically significant race-related comparisons [40%]) and least likely to have significantly higher accuracy (1/10 [10%]). Vision models with female patients under basic prompting showed a higher number of significant race-related differences (6/10 [60%]) than vision models with male patients and all text-only models. Text-only and vision models showed no statistically significant differences in diagnostic accuracy across prompting strategies (p = 0.78 and 0.95, respectively); basic and self-consistency prompting produced ten and four significant race- or sex-related differences, respectively, whereas no significant differences were observed with chain-of-thought prompting.

CONCLUSION: Large language models exhibited racial and sex biases in diagnostic radiology, and chain-of-thought prompting may help mitigate these biases.

PMID:42247082 | DOI:10.1007/s11604-026-02021-6

Categories
Nevin Manimala Statistics

Thoughts of Death or Self-injury, Non-same-day ART Initiation, and 2-year Incidence of Disengagement from Care Among People Entering HIV Care in Cameroon

AIDS Behav. 2026 Jun 5. doi: 10.1007/s10461-026-05149-8. Online ahead of print.

ABSTRACT

Thoughts of death or self-injury and the clinical implications of such thoughts remain largely underassessed among people with HIV (PWH) in Africa. As strong predictors of suicidal ideation and death by suicide, it is paramount to understand these risk indicators, particularly in populations with heightened susceptibility to poor mental health. We aimed to characterize thoughts of death or self-injury (i.e., self-injurious thoughts) and their relationship with non-same-day (i.e., delayed) anti-retroviral treatment (ART) initiation and longitudinal disengagement from clinic in a cohort of PWH newly entering HIV care in Cameroon. We conducted structured interviews with PWH aged 21 + initiating clinical care between June 2019 and March 2020. Clinical records were used to ascertain ART initiation date and disengagement from the clinic across two years following care initiation. Log binomial regression was used to estimate the association between self-injurious thoughts at care initiation and delayed ART initiation. A Fine and Gray sub-distribution proportional hazards model was used to quantify the cumulative incidence of disengagement from the clinic (i.e., a gap in clinic visits > 183 days) and differences in these estimates across groups. Of 426 enrolled individuals, seventy-one (16.7%) endorsed self-injurious thoughts at entry into care, 24 (33.8%) of whom had active thoughts of self-injury. Self-injurious thoughts at entry into care were positively associated with delayed ART initiation (adjusted prevalence ratio = 1.8; 95% CI: 1.2, 2.9) and cumulative incidence of disengagement from clinic, though differences in disengagement were statistically non-significant. Interventions are urgently needed to support the mental health and well-being of PWH entering care and improve down-stream HIV treatment outcomes.

PMID:42247071 | DOI:10.1007/s10461-026-05149-8

Categories
Nevin Manimala Statistics

Gallic acid attenuates LPS-induced acute kidney injury with accompanying changes in inflammatory and redox-related signaling pathways

Mol Biol Rep. 2026 Jun 5;53(1):893. doi: 10.1007/s11033-026-12056-7.

ABSTRACT

BACKGROUND: Lipopolysaccharide (LPS)-induced sepsis is one of the leading causes of acute kidney injury (AKI). Gallic acid (GAL), a natural polyphenolic compound, exhibits potent antioxidant, anti-inflammatory, and cytoprotective properties. This study aimed to investigate the renoprotective effects of GAL.

METHODS: Thirty-two adult male Wistar rats were randomly divided into four groups (n = 8): Control, LPS (5 mg/kg, i.p.), LPS + GAL (100 mg/kg i.p. administered immediately before LPS), representing a preventive treatment design and GAL-alone. Renal tissues were examined histologically using hematoxylin-eosin staining and toll-like receptor 4(TLR4), nuclear factor kappa B(NF-κB), and interleukin-1 beta (IL-1β) immunoexpression by immunohistochemical staining. mRNA expression levels of sirtuin 1(SIRT1), adenosine monophosphate-activated protein kinase(AMPK), forkhead box O3(FOXO3), phosphoinositide 3-kinase (PI3K), protein kinase B(AKT1), glutathione peroxidase 4(GPX4), aquaporin 4 (AQP4), glycogen synthase kinase-3 beta(GSK3β) were assessed. Serum creatinine (CRE) and urea levels were analyzed to determine renal functional status.

RESULTS: LPS administration caused marked renal injury, characterized by severe hyperemia, hemorrhage, tubular necrosis, and increased TLR4, NF-κB, and IL-1β expression. GAL treatment attenuated these changes, with a significant reduction only in hyperemia scores, while decreases in hemorrhage, neutrophil infiltration, and necrosis remained nonsignificant. GAL administration was associated with reversal of the LPS-related decrease in PI3K, AKT1, AMPK, FOXO3, SIRT1, GPX4 and AQP4 mRNA expression, along with reduced GSK3β mRNA expression. Serum CRE levels improved with GAL, whereas the urea reduction was not statistically significant.

CONCLUSION: GAL showed a renoprotective effect in this preventive model of LPS-induced AKI and was associated with reduced inflammatory marker expression together with favorable changes in PI3K/AKT1/FOXO3/SIRT1-related mRNA profiles. These findings suggest that GAL may have prophylactic against early endotoxin-associated renal injury.

PMID:42247065 | DOI:10.1007/s11033-026-12056-7

Categories
Nevin Manimala Statistics

Light Scattering Characterization of Bacterial Morphology in Aqueous Suspension

J Phys Condens Matter. 2026 Jun 4. doi: 10.1088/1361-648X/ae7886. Online ahead of print.

ABSTRACT

We present a combined light-scattering methodology for the ensemble-averaged characterization of bacterial morphology and size distributions directly in aqueous suspension. The approach integrates heterodyne near-field scattering with three-dimensional cross-correlation dynamic light scattering, providing continuous access to the scattered intensity over an extended range of scattering vectors. This configuration enables simultaneous probing of length scales associated with both the axial and radial dimensions of bacterial cells. The measured intensity profiles exhibit form-factor features characteristic of spherocylindrical particles, including oscillatory behavior arising from finite particle geometry. To interpret the experimental data, the bacterial population is modeled as a polydisperse ensemble of randomly oriented spherocylinders. Orientational averaging and population heterogeneity are explicitly incorporated into the scattering model, allowing quantitative extraction of distributions of cell length and radius for multiple bacterial strains. The calculated form factors show good agreement with experimental measurements across the full scattering range, demonstrating sensitivity to both axial and transverse polydispersity. These results establish the combined scattering framework as a statistically robust method for characterizing bacterial morphology in suspension. The approach complements microscopy-based measurements and provides a quantitative experimental basis for physical modeling of bacterial systems relevant to biomedical, environmental, and industrial contexts.

PMID:42246133 | DOI:10.1088/1361-648X/ae7886

Categories
Nevin Manimala Statistics

The Relationship Between Internalized Stigma and Well-Being, Quality of Life, Depression and Perceived Social Support in Older Adults Attending an Internal Medicine Outpatient Clinic

Psychogeriatrics. 2026 Jul;26(4):e70187. doi: 10.1111/psyg.70187.

ABSTRACT

BACKGROUND: Population ageing is accompanied by increasing exposure to psychosocial challenges, including age-related stereotypes and stigma. Internalized age-related stigma has emerged as an important psychosocial factor associated with mental health and well-being among older adults. However, empirical evidence examining its associations with well-being, quality of life, depressive symptoms and perceived social support remains limited, particularly in non-Western cultural contexts.

METHODS: This cross-sectional quantitative study included 139 adults aged 65 years and older attending an internal medicine outpatient clinic. Cognitive eligibility was assessed using the revised Turkish version of the Mini-Mental State Examination. Data were collected through face-to-face interviews. Measures included the Internalized Stigma of Ageing Scale (ISAS), WHO-5 Well-Being Index, Geriatric Depression Scale (GDS-15), WHO Quality of Life-AGE (WHOQOL-AGE), Multidimensional Scale of Perceived Social Support (MSPSS), and the Attitudes Towards Ageing Questionnaire. Statistical analyses comprised t-tests, one-way analysis of variance, Pearson correlation and hierarchical multiple linear regression.

RESULTS: Participants demonstrated moderate levels of internalized age-related stigma. Internalized stigma was significantly higher among participants who were unemployed and those living alone or with adult children. Internalized stigma was negatively associated with well-being, quality of life and perceived social support, and positively associated with depressive symptoms. Regression analyses indicated that lower well-being, poorer quality of life, higher depressive symptoms and lower perceived social support were significantly associated with higher internalized stigma.

CONCLUSIONS: Internalized age-related stigma is a significant psychosocial correlate of mental well-being and quality of life in later life. Interventions targeting psychological well-being and social support may play a role in mitigating stigma-related vulnerability in later life.

PMID:42246113 | DOI:10.1111/psyg.70187

Categories
Nevin Manimala Statistics

How Medicare Advantage Enrollment Affects Spending in Traditional Medicare: Spillovers and Implications for MA Benchmarks

J Health Polit Policy Law. 2026 Jun 5:12650571. doi: 10.1215/03616878-12650571. Online ahead of print.

ABSTRACT

CONTEXT: Arguably, the most pressing issues in Medicare financing today center on Medicare Advantage (MA). Rising MA enrollment may lower TM spending via a spillover effect wherein cost-efficient practices in MA are applied to TM enrollees. At the same time, growing MA enrollment results in a shrinking share of TM enrollees, raising concerns about how MA plans are reimbursed, since MA payments and rebates depend, in part, on average TM spending.

METHODS: This study estimates the effect of MA enrollment shares on per-enrollee TM spending using county-level CMS data. TM spending is the product of three factors: a local price index for TM services, average price- and risk-adjusted spending per enrollee (“quantity” of services used), and average TM risk scores. We estimate the effect of MA enrollment shares on each factor.

FINDINGS: Increased MA enrollment is associated with a statistically significant reduction in TM price- and risk-adjusted spending. MA enrollment is linked to reduced outpatient service use and reduced use of procedures, tests, imaging, durable medical equipment, and ambulance services among TM enrollees.

CONCLUSIONS: Reduced spending in TM from rising MA enrollment is a form of Medicare savings that offsets, to a small degree, the substantial estimated overpayments made to MA plans.

PMID:42246093 | DOI:10.1215/03616878-12650571

Categories
Nevin Manimala Statistics

Modelling the risk of West Nile virus infection in seven European countries from published serological and case notification data, 2008 to 2022

Euro Surveill. 2026 Jun;31(22). doi: 10.2807/1560-7917.ES.2026.31.22.2500394.

ABSTRACT

BACKGROUNDWest Nile virus (WNV) is a zoonotic mosquito-borne pathogen increasingly reported in Europe.AIMWe aimed to characterise heterogeneities in the average annual human risk of WNV infection (force of infection, FOI) and in WNV surveillance across Europe.METHODSWe conducted a systematic review following the PRISMA guidelines to identify serological studies on WNV in humans with IgG-based assays in Europe. We then used mathematical models fitted to both age-stratified serosurvey and case data to reconstruct spatially explicit FOI estimates, the sensitivity of syndromic surveillance and age-dependent trends in case reporting.RESULTSWe extracted 92 serosurvey datasets from 21 countries. Based on 10 age-stratified serosurvey datasets from Greece, Hungary, Italy, Romania and Spain and case data from seven countries (Austria, Cyprus, Greece, Hungary, Italy, Romania and Spain), we estimated the WNV FOI for 119 European nomenclature of territorial units for statistics level (NUTS) 0-3 regions. We found evidence of spatial heterogeneities in transmission intensity and estimated that on average less than 0.2% of human WNV infections were notified, with country variability and age-dependent trends in the propensity of reporting WNV disease.CONCLUSIONThis study shows that the intensity of WNV transmission, the average annual incidence of infection and the sensitivity of surveillance are heterogeneous across Europe. Due to differences in case reporting across countries, the incidence of reported WNV cases does not necessarily reflect the same proportion of WNV infections and hence the actual infection incidence, which highlights the importance of conducting WNV seroprevalence surveys.

PMID:42246068 | DOI:10.2807/1560-7917.ES.2026.31.22.2500394

Categories
Nevin Manimala Statistics

Robust Estimation of Population Attributable Fractions in the Presence of Multiple Ordered Mediators

Stat Med. 2026 Jun;45(13-14):e70636. doi: 10.1002/sim.70636.

ABSTRACT

Population Attributable Fraction (PAF) is a key epidemiological measure used to quantify the contribution of risk factors to the overall disease burden. However, when an exposure affects an outcome through multiple ordered mediators, traditional PAF estimation methods face challenges in accurately identifying the impact of each mediating pathway. These challenges arise from mediator-outcome relationships, interactions among mediators, and the presence of potential confounders. In this study, we propose new measures, termed mPAFs, to quantify the fraction of disease attributable to a specific mediation pathway. The proposed framework incorporates a multiply robust estimator that yields consistent estimates of mPAFs provided that at least two of the three types of models are correctly specified: the exposure models, mediator models, or outcome model. The asymptotic properties of the estimator are formally established, and a comprehensive simulation study is conducted to demonstrate its robustness against model misspecification. In a real-data application using TCGA lung cancer cohorts, we analyzed the effect of smoking on mortality mediated through TTK and MAD2L1. In lung adenocarcinoma, the total PAF was estimated at 4.45%, with a direct effect of 1.82% and pathway-specific contributions of -1.95% (TTK) and 0.68% (MAD2L1). In contrast, lung squamous cell carcinoma showed a higher total PAF of 10.43%, with most of the effect attributable to the direct pathway (10.22%), suggesting minimal mediation via the selected genes.

PMID:42246056 | DOI:10.1002/sim.70636

Categories
Nevin Manimala Statistics

Trajectories and Adaptive Recalibration of Self-Efficacy in Brazilian Medical Students: A Longitudinal Study With Implications for Educational Support

Clin Teach. 2026 Aug;23(4):e70448. doi: 10.1111/tct.70448.

ABSTRACT

INTRODUCTION: Self-efficacy, the belief in one’s capacity to execute actions necessary to achieve specific objectives, is a significant predictor of academic performance and well-being in medical training. Longitudinal studies examining the dynamic evolution of self-efficacy throughout undergraduate medical education remain scarce in the Brazilian context. This study prospectively evaluated perceived self-efficacy among medical students during critical curricular transitions.

METHODS: A prospective longitudinal study followed 50 medical students from a Brazilian public university at four time points: course entry (T1-2021), end of first year (T2-2022), midpoint of clinical cycle (T3-2023) and beginning of fifth year/internship (T4-2025). The Self-Efficacy Scale for Higher Education (SESHE), a validated 34-item instrument assessing five dimensions, was administered. Statistical analyses included repeated measures ANOVA, Wilcoxon test and effect size calculations (Cohen’s d).

RESULTS: The study observed a dynamic evolution in perceived self-efficacy, characterised by a notable shift in total self-efficacy from 2021 to 2023, followed by partial recovery by 2025. This pattern included significant reductions in several dimensions. Self-efficacy in proactive actions experienced the most pronounced change (d > 0.8), while self-efficacy in social interaction demonstrated relative stability. Baseline analyses revealed higher self-efficacy in social interaction among male students (d = 0.51).

CONCLUSION: These findings suggest that the observed changes in self-efficacy represent an adaptive recalibration, moving toward a more realistic self-assessment essential for professional development. Therefore, strategies that support this healthy recalibration are crucial. These interventions should aim to facilitate adaptive adjustment, fostering robust and accurate self-efficacy throughout medical training.

PMID:42246054 | DOI:10.1111/tct.70448

Categories
Nevin Manimala Statistics

Identifying key predictors of post-stroke depression and cognitive impairment in acute stroke survivors

Front Neurol. 2026 May 20;17:1636511. doi: 10.3389/fneur.2026.1636511. eCollection 2026.

ABSTRACT

BACKGROUND: Post-stroke depression (PSD) and post-stroke cognitive impairment (PSCI) are prevalent complications in aging stroke survivors and are often overlooked due to the lack of early diagnostic indicators, leading to poor prognosis. Identifying reliable predictors is crucial for timely intervention.

METHODS: This prospective cohort study followed 78 acute stroke survivors for 6 months. A composite neuropsychological outcome-defined as the development of PSD and/or PSCI-was determined using the Diagnostic and Statistical Manual of Mental Disorders-5th Edition (DSM-5) and NINDS-CSN criteria. To account for the limited sample size, multivariable Firth’s penalized logistic regression was employed to identify independent predictors, generating robust odds ratios (ORs) and 95% confidence intervals (CIs). An exploratory classification and regression tree (CART) analysis was also conducted for hypothesis generation.

RESULTS: The final cohort comprised 78 acute ischemic stroke survivors with a median age of 62 years (IQR 51-71). Among these participants, 26.0% were women, and the median admission score on the National Institutes of Health Stroke Scale (NIHSS) was 3 (IQR 1-5). Within 6 months, 56 patients (71.8%) developed the composite outcome (13 experienced PSCI alone, 24 had PSD alone, and 19 had both conditions). A multivariable analysis revealed that right hemisphere lesions (OR = 9.019, 95% CI: 1.329-61.213, p = 0.016), greater baseline emotional distress (higher 9-item Patient Health Questionnaire (PHQ-9) scores; OR = 5.157, 95% CI: 1.835-14.494, p < 0.001), and pre-existing cognitive vulnerability (lower Mini-Mental State Examination (MMSE) scores; OR = 0.714, 95% CI: 0.517-0.984, p = 0.023) were independent predictors of poor neuropsychological outcomes. Advanced age (p = 0.094) and elevated urea levels (p = 0.095) showed only marginal trends. Exploratory CART modeling highlighted the hierarchical interaction of these baseline clinical scores for risk stratification.

CONCLUSION: Right hemisphere lesions, early emotional distress, and baseline cognitive vulnerability independently predicted a high risk of composite neuropsychological impairment at 6 months post-stroke. Rather than serving merely as novel biomarkers, high baseline PHQ-9 scores and low MMSE scores reflected the persistence of early distress and poor cognitive reserve, respectively. These highly accessible clinical parameters facilitate early risk stratification, emphasizing the absolute need for immediate psychological triage and integrated, long-term cognitive-emotional monitoring.

PMID:42246052 | PMC:PMC13229621 | DOI:10.3389/fneur.2026.1636511