Categories
Nevin Manimala Statistics

Paired-Sample and Pathway-Anchored MLOps Framework for Robust Transcriptomic Machine Learning in Small Cohorts: Model Classification Study

JMIR Bioinform Biotechnol. 2025 Oct 8;6:e80735. doi: 10.2196/80735.

ABSTRACT

BACKGROUND: Approximately 90% of the 65,000 human diseases are infrequent, collectively affecting ~400 million people, substantially limiting cohort accrual. This low prevalence constrains the development of robust transcriptome-based machine learning (ML) classifiers. Standard data-driven classifiers typically require cohorts of more than 100 participants per group to achieve clinical accuracy while managing high-dimensional input (~25,000 transcripts). These requirements are infeasible for microcohorts of ~20 individuals, where overfitting becomes pervasive.

OBJECTIVE: To overcome these constraints, we developed a classification method that integrates three enabling strategies: (i) paired-sample transcriptome dynamics, (ii) N-of-1 pathway-based analytics, and (iii) reproducible machine learning operations (MLOps) for continuous model refinement.

METHODS: Unlike ML approaches relying on a single transcriptome per subject, within-subject paired-sample designs-such as pre- versus post-treatment or diseased versus adjacent-normal tissue-effectively control intraindividual variability under isogenic conditions and within-subject environmental exposures (eg, smoking history, other medications, etc), improve signal-to-noise ratios, and, when pre-processed as single- studies (N-of-1), can achieve statistical power comparable with that obtained in animal models. Pathway-level N-of-1 analytics further reduces each sample’s high-dimensional profile into ~4000 biologically interpretable features, annotated with effect sizes, dispersion, and significance. Complementary MLOp practices-automated versioning, continuous monitoring, and adaptive hyperparameter tuning-improve model reproducibility and generalization.

RESULTS: In two case studies of distinct diseases, human rhinovirus infection (HRV) versus matched healthy controls (n=16 training; n=3 test) and breast cancer tissues harboring TP53 or PIK3CA mutations versus adjacent normal tissue (n=27 training; n=9 test)-this approach achieved 90% precision and recall on an unseen breast cancer test set and 92% precision with 90% recall in rhinovirus fivefold cross-validation. Incorporating paired-sample dynamics boosted precision by up to 12% and recall by 13% in breast cancer and by 5% each in HRV. MLOps workflows yielded an additional ~14.5% accuracy improvement compared to traditional pipelines. Moreover, our method identified 42 critical gene sets (pathways) for rhinovirus response and 21 for breast cancer mutation status, selected as the most important features (mean decrease impurity) of the best-performing model, with retroactive ablation of top 20 features reducing accuracy by ~25%.

CONCLUSIONS: These proof-of-concept results support the utility of integrating intrasubject dynamics, “biological knowledge”-based feature reduction (pathway-level feature reduction grounded in prior biological knowledge; eg, N-of-1-pathway analytics), and reproducible MLOp workflows can overcome cohort size limitations in infrequent disease, offering a scalable, interpretable solution for high-dimensional transcriptomic classification. Future work will extend these advances across various therapeutic and small cohort designs.

PMID:41342203 | DOI:10.2196/80735

Categories
Nevin Manimala Statistics

Potential metabolite biomarkers of drought tolerance in contrasting Sideroxylon spinosum L. ecotypes using a metabolomic approach

J Sci Food Agric. 2025 Dec 4. doi: 10.1002/jsfa.70365. Online ahead of print.

ABSTRACT

BACKGROUND: Sideroxylon spinosum L., an endemic Moroccan species, holds significant ecological and socioeconomic importance. Drought stress severely affects plant survival by disrupting metabolic and physiological processes. This study aimed to investigate drought-induced metabolic changes and identify potential biomarkers in two S. spinosum L. ecotypes that contrast primarily in their climatic origins – Aoulouz (Alz, inland) and Lakhssas (Lks, coastal) – using metabolomic analysis.

RESULTS: Gas chromatography-mass spectrometry analysis detected 700 and 600 peaks in Lks and Alz leaves, respectively, with 120 and 100 corresponding to known metabolites. Under drought stress conditions, statistical analysis (t-test, P < 0.05) identified 44 significant metabolites in Lks and 56 in Alz. According to the volcano plot (log₁₀(P) versus log₂ fold change), 34 metabolites were upregulated and 10 downregulated in Lks, whereas 25 were upregulated and 31 downregulated in Alz. The criteria for significance included a fold change ≥ 2.0 and false discovery rate < 0.05. Multivariate analyses showed clear separation between control and drought-stressed samples. Based on variable importance in projection scores and receiver operating characteristic curve analyses, ten potential drought-tolerance biomarkers were identified. In Alz, two upregulated metabolites (M65 (lupeol) and M102 (octadecane)) and three downregulated metabolites (M108 (octacosane), M123 (5-octadecene, E) and M200 (4-nitrobenzylidenenemalonic acid, diethyl ester)) were key. In contrast, Lks exhibited five upregulated biomarkers: M6 (hexadecanoic acid, methyl ester), M54 (1,3,6,10-cyclotetradecatetraene, 3,7,11-trimethyl-14-(1-methylethyl)), M88, M91 and M142. Metabolites M65 and M102 in Alz likely enhance cuticular integrity and reactive oxygen species scavenging, while M6 and M54 in Lks suggest a reliance on lipid signaling and energy metabolism for transient drought adaptation. However, the Lks ecotype could be less resilient under prolonged drought due to a greater ability to reallocation metabolism. These biomarkers offer valuable targets for breeding or biotechnological interventions.

CONCLUSION: This study offers valuable insights into the metabolic mechanisms involved in drought defense in S. spinosum L. and highlights specific biomarkers linked to drought tolerance. The Alz ecotype demonstrates enhanced resilience through cuticular reinforcement and oxidative stress mitigation, whereas the Lks ecotype relies on transient metabolic adjustments. These findings offer valuable insights and potential targets for improving drought tolerance in plants through future research. © 2025 Society of Chemical Industry.

PMID:41342197 | DOI:10.1002/jsfa.70365

Categories
Nevin Manimala Statistics

Single-session bilateral supine percutaneous nephrolithotomy: Safety and efficacy

Urologia. 2025 Dec 4:3915603251398256. doi: 10.1177/03915603251398256. Online ahead of print.

ABSTRACT

PURPOSE: To evaluate the safety and efficacy of single-session supine bilateral percutaneous nephrolithotomy (BPCNL) in patients with bilateral renal stones.

METHODS: We retrospectively identified patients from February 2019 to July 2023 with bilateral renal stones measuring >2 cm and <5 cm in their maximum dimension for each side mainly located in the renal pelvis that had been treated with single-session supine BPCNL. The stone-free rate was accepted when remaining fragments of ⩽ 2 mm were discovered by a computed tomography scan.

RESULTS: Fifty-two patients with bilateral renal stones measuring 2:5 cm in their maximum dimension for each side who had been treated with single-session supine BPCNL were included in the study; a statistically significant difference in serum creatinine level was detected on day 1 postoperatively (p < 0.0001) compared with the baseline values that became insignificant at 1 week and 1 month postoperatively (p = 0.403 and 0.471 respectively). Also, statistically significant difference in glomerular filtration rate was detected at day 1 postoperatively (p < 0.0001) compared with the baseline values that became insignificant at 1 week and 1 month postoperatively (p = 0.95 and 0.07 respectively implicating early renal affection that shortly returned to normal values. The mean operative time for both sides was 126.5 ± 22.9 min, and the mean hemoglobin drop after the procedure was 1.9 ± 0.99 g/dl. The primary stone-free rate was 75%, with 11.6% of the remaining patients had a residual insignificant stones >2 mm but still less than 6 mm. Finally, 13.4% of the patients needed ancillary procedures.

CONCLUSION: Single-session supine bilateral PCNL is both safe and effective for patients with bilateral renal stones. However, this is a complex procedure that should only be performed by expert surgeons in a tertiary centers.

TRIAL REGISTRATION NUMBER: (167) SPS/URS_008 retrospectively registered.

PMID:41342193 | DOI:10.1177/03915603251398256

Categories
Nevin Manimala Statistics

Protein-Protein Interactions in Papillary and Nonpapillary Urothelial Carcinoma Architectures: Comparative Study

JMIR Bioinform Biotechnol. 2025 Nov 27;6:e76736. doi: 10.2196/76736.

ABSTRACT

BACKGROUND: Bladder cancer is a disease characterized by complex perturbations in gene networks and is heterogeneous in terms of histology, mutations, and prognosis. Advances in high-throughput sequencing technologies, genome-wide association studies, and bioinformatics methods have revealed greater insights into the pathogenesis of complex diseases. Network biology-based approaches have been used to identify complex protein-protein interactions (PPIs) that can lead to potential drug targets. There is a need to better understand PPIs specific to urothelial carcinoma.

OBJECTIVE: This study aimed to elucidate PPIs specific to papillary and nonpapillary urothelial carcinoma and identify the most connected or “hub” proteins, as these are potential drug targets.

METHODS: A novel PPI analysis tool, Proteinarium, was used to analyze RNA sequencing data from 132 patients with papillary and 270 patients with nonpapillary urothelial carcinoma from the TCGA Cell 2017 dataset and 39 patients with papillary and 88 patients with nonpapillary urothelial carcinoma from the TCGA Nature 2014 dataset. Hub proteins were identified in distinct PPI networks specific to papillary and nonpapillary urothelial carcinoma. Statistical significance of clusters was assessed using the Fisher exact test (P<.001), and network separation was quantified using the interactome-based separation score.

RESULTS: RPS27A, UBA52, and VAMP8 were the most connected or “hub” proteins identified in the network specific to the papillary urothelial carcinoma. In the network specific to the nonpapillary carcinoma, GNB1, RHOA, UBC, and FPR2 were found to be the hub proteins. Notably, GNB1 and FPR2 were among the proteins that have existing drugs targeting them.

CONCLUSIONS: We identified distinct PPI networks and the hub proteins specific to papillary and nonpapillary urothelial carcinomas. However, these findings are limited by the use of transcriptomic data and require experimental validation to confirm the functional relevance of the identified targets.

PMID:41342186 | DOI:10.2196/76736

Categories
Nevin Manimala Statistics

Safety and effectiveness of tirzepatide during Ramadan fasting: Real-world evidence from patients with type 2 diabetes in Bangladesh

Diabetes Obes Metab. 2025 Dec 4. doi: 10.1111/dom.70343. Online ahead of print.

ABSTRACT

AIMS: Ramadan fasting poses challenges for patients with type 2 diabetes mellitus (T2DM) due to increased risks of hypoglycemia and metabolic fluctuations. Tirzepatide, a dual GIP/GLP-1 receptor agonist, has shown marked efficacy in glycemic control and weight reduction. This study aimed to evaluate the safety and effectiveness of tirzepatide among Bangladeshi patients with T2DM during Ramadan fasting.

METHODS: This prospective, multicentre, real-world evidence study included 109 adult patients with T2DM who intended to fast during Ramadan and were prescribed tirzepatide 2.5 mg weekly, either as monotherapy or in combination with other anti-hyperglycemic agents. Data on glycemic parameters, anthropometrics, blood pressure, lipid profile, renal and liver function were collected at 2-6 weeks before Ramadan and at 2-6 weeks after the end of Ramadan, along with incidences of adverse events. Statistical analysis was performed using SPSS 25.0.

RESULTS: The mean age of the study participants was 40.7 ± 12.8 (SD) years with female predominance (69.7%). About 86.7% of the participants were obese. The mean HbA1c significantly decreased from 7.6% (before Ramadan) to 6.5% (after Ramadan) (mean change: -1.1%; p <0.001). Fasting plasma glucose and 2-h postprandial glucose also showed significant reductions by -2 mmol/L and – 3.8 mmol/L, respectively (both p <0.001). Mean body weight reduction was 5.3 ± 3.9 kg (6.3% of baseline; p <0.001). Mild gastrointestinal events occurred in ~12% of participants, with no hypoglycemia reported.

CONCLUSION: Tirzepatide demonstrated significant improvements in glycaemic control and body weight, with good tolerability, among patients with type 2 diabetes in Bangladesh who fasted during Ramadan.

PMID:41342185 | DOI:10.1111/dom.70343

Categories
Nevin Manimala Statistics

Lung Cancer Diagnosis From Computed Tomography Images Using Deep Learning Algorithms With Random Pixel Swap Data Augmentation: Algorithm Development and Validation Study

JMIR Bioinform Biotechnol. 2025 Sep 3;6:e68848. doi: 10.2196/68848.

ABSTRACT

BACKGROUND: Deep learning (DL) shows promise for automated lung cancer diagnosis, but limited clinical data can restrict performance. While data augmentation (DA) helps, existing methods struggle with chest computed tomography (CT) scans across diverse DL architectures.

OBJECTIVE: This study proposes Random Pixel Swap (RPS), a novel DA technique, to enhance diagnostic performance in both convolutional neural networks and transformers for lung cancer diagnosis from CT scan images.

METHODS: RPS generates augmented data by randomly swapping pixels within patient CT scan images. We evaluated it on ResNet, MobileNet, Vision Transformer, and Swin Transformer models, using 2 public CT datasets (Iraq-Oncology Teaching Hospital/National Center for Cancer Diseases [IQ-OTH/NCCD] dataset and chest CT scan images dataset), and measured accuracy and area under the receiver operating characteristic curve (AUROC). Statistical significance was assessed via paired t tests.

RESULTS: The RPS outperformed state-of-the-art DA methods (Cutout, Random Erasing, MixUp, and CutMix), achieving 97.56% accuracy and 98.61% AUROC on the IQ-OTH/NCCD dataset and 97.78% accuracy and 99.46% AUROC on the chest CT scan images dataset. While traditional augmentation approaches (flipping and rotation) remained effective, RPS complemented them, surpassing the performance findings in prior studies and demonstrating the potential of artificial intelligence for early lung cancer detection.

CONCLUSIONS: The RPS technique enhances convolutional neural network and transformer models, enabling more accurate automated lung cancer detection from CT scan images.

PMID:41342173 | DOI:10.2196/68848

Categories
Nevin Manimala Statistics

Estimating Antigen Test Sensitivity via Target Distribution Balancing: Development and Validation Study

JMIR Bioinform Biotechnol. 2025 Oct 20;6:e68476. doi: 10.2196/68476.

ABSTRACT

BACKGROUND: Sensitivity-expressed as percent positive agreement (PPA) with a reference assay-is a primary metric for evaluating lateral-flow antigen tests (ATs), typically benchmarked against a quantitative reverse transcription polymerase chain reaction (qRT-PCR). In SARS-CoV-2 diagnostics, ATs detect nucleocapsid protein, whereas qRT-PCR detects viral RNA copy numbers. Since observed PPA depends on the underlying viral load distribution (proxied by the number of cycle thresholds [Cts], which is inversely related to load), study-specific sampling can bias sensitivity estimates. Cohort differences-such as enrichment for high- or low-Ct specimens-therefore complicate cross-test comparisons, and real-world datasets often deviate from regulatory guidance to sample across the full concentration range. Although logistic models relating test positivity to Ct are well described, they are seldom used to reweight results to a standardized reference viral load distribution. As a result, reported sensitivities remain difficult to compare across studies, limiting both accuracy and generalizability.

OBJECTIVE: The aim of this study was to develop and validate a statistical methodology that estimates the sensitivity of ATs by recalibrating clinical performance data-originally obtained from uncontrolled viral load distributions-against a standardized reference distribution of target concentrations, thereby enabling more accurate and comparable assessments of diagnostic test performance.

METHODS: AT sensitivity is estimated by modeling the PPA as a function of qRT-PCR Ct values (PPA function) using logistic regression on paired test results. Raw sensitivity is the proportion of AT positives among PCR-positive samples. Adjusted sensitivity is calculated by applying the PPA function to a reference Ct distribution, correcting for viral load variability. This enables standardized comparisons across tests. The method was validated using clinical data from a community study in Chelsea, Massachusetts, demonstrating its effectiveness in reducing sampling bias.

RESULTS: Over a 2-year period, paired ATs and qRT-PCR-positive samples were collected from 4 suppliers: A (n=211), B (n=156), C (n=85), and D (n=43). Ct value distributions varied substantially, with suppliers A and D showing lower Ct (high viral load) values in the samples, and supplier C skewed toward higher Ct values (low viral load). These differences led to inconsistent raw sensitivity estimates. To correct for this, we used logistic regression to model the PPA as a function of Cts and applied these models to a standardized reference Ct distribution. This adjustment reduced bias and enabled more accurate comparisons of test performance across suppliers.

CONCLUSIONS: We present a distribution-aware framework that models PPA as a logistic function of Ct and reweights results to a standardized reference Ct distribution to produce bias-corrected sensitivity estimates. This yields fairer, more consistent comparisons across AT suppliers and studies, strengthens quality control, and supports regulatory review. Collectively, our results provide a robust basis for recalibrating reported sensitivities and underscore the importance of distribution-aware evaluation in diagnostic test assessment.

PMID:41342172 | DOI:10.2196/68476

Categories
Nevin Manimala Statistics

Conversational Artificial Intelligence for Integrating Social Determinants, Genomics, and Clinical Data in Precision Medicine: Development and Implementation Study of the AI-HOPE-PM System

JMIR Bioinform Biotechnol. 2025 Oct 10;6:e76553. doi: 10.2196/76553.

ABSTRACT

BACKGROUND: Integrating clinical, genomic, and social determinants of health (SDOH) data is essential for advancing precision medicine and addressing cancer health disparities. However, existing bioinformatics tools often lack the flexibility to perform equity-driven analyses or require significant programming expertise.

OBJECTIVE: We developed AI-HOPE-PM (Artificial Intelligence Agent for High-Optimization and Precision Medicine in Population Metrics), a conversational artificial intelligence system designed to enable natural language-driven, multidimensional cancer analysis. This study describes the development, implementation, and application of AI-HOPE-PM to support hypothesis testing that integrates genomic, clinical, and SDOH data.

METHODS: AI-HOPE-PM leverages large language models and Python-based statistical scripts to convert user-defined natural language queries into executable workflows. It was evaluated using curated colorectal cancer datasets from The Cancer Genome Atlas and cBioPortal, enriched with harmonized SDOH variables. Accuracy of natural language interpretation, run time efficiency, and usability were benchmarked against cBioPortal and UCSC Xena.

RESULTS: AI-HOPE-PM successfully supported case-control stratification, survival modeling, and odds ratio analysis using natural language prompts. In colorectal cancer case studies, the system revealed significant disparities in progression-free survival and treatment access based on financial strain, health care access, food insecurity, and social support, demonstrating the importance of integrating SDOH in cancer research. Benchmark testing showed faster task execution compared to existing platforms, and the system achieved 92.5% accuracy in parsing biomedical queries.

CONCLUSIONS: AI-HOPE-PM lowers technical barriers to integrative cancer research by enabling real-time, user-friendly exploration of clinical, genomic, and SDOH data. It expands on prior work by incorporating equity metrics into precision oncology workflows and offers a scalable tool for supporting disparities-focused translational research. Five videos are included as multimedia appendices to demonstrate platform functionality in real-world scenarios.

PMID:41342165 | DOI:10.2196/76553

Categories
Nevin Manimala Statistics

Long term graft survival and rejection rate of zero-human leukocyte-antigen-mismatched deceased donor kidney transplant recipients: a retrospective multicentric cohort study

Kidney Res Clin Pract. 2025 Nov 17. doi: 10.23876/j.krcp.24.238. Online ahead of print.

ABSTRACT

BACKGROUND: Historically, human leukocyte antigen (HLA) matching has been a cornerstone of kidney transplantation (KT), with favorable outcomes. However, the survival benefit of KT with zero HLA mismatches appears to have decreased with the accumulation of transplantation experience and advancements in immunosuppressive therapies.

METHODS: This was a prospective observational cohort study based on data from the Korean Organ Transplantation Registry, including patients who underwent deceased donor KT from May 2014 to December 2022. A total of 3,350 KT patients were propensity score-matched at a 1:1 ratio and compared according to zero HLA mismatching (zero group) vs. non-zero HLA mismatching (non-zero group).

RESULTS: After matching, 276 patients in the zero group were compared to 276 patients in the non-zero group. Over a follow-up period of 38.4 ± 28.8 months, the use of immunosuppressants was similar between the two groups. Multivariable-adjusted hazard ratios of non-zero group vs. zero group were 1.63 (95% confidence interval [CI], 0.72-3.69; p = 0.24) for death censored graft failure, 1.62 (95% CI, 0.96-2.76; p = 0.07) for biopsy-proven rejection, 2.09 (95% CI, 0.87-5.00; p = 0.10) for death, 1.38 (95% CI, 1.02-1.86; p = 0.03) for posttransplant infection and 4.48 (95% CI, 1.52-13.25; p = 0.001) for antibody mediated rejection.

CONCLUSION: This study suggests that rigid adherence to HLA matching may be less critical than previously thought, particularly due to advancements in immunosuppressive therapies.

PMID:41342160 | DOI:10.23876/j.krcp.24.238

Categories
Nevin Manimala Statistics

Temporal trends in acute kidney injury-related mortality across 43 countries, 1996-2021, with projections up to 2050: a global time series analysis and modelling study

Kidney Res Clin Pract. 2025 Dec 4. doi: 10.23876/j.krcp.25.224. Online ahead of print.

ABSTRACT

BACKGROUND: Acute kidney injury (AKI) is a major global public health concern. However, a major challenge in addressing the AKI burden is the lack of global data on AKI-related mortality and its predictions, leaving significant limitations in understanding its trends over time. Therefore, we aimed to estimate AKI-related mortality rate trends and forecast future deaths.

METHODS: We evaluated the temporal trends in age-standardized AKI-related mortality from 1996 to 2021 across 43 countries using the World Health Organization Mortality Database, with future projections through to 2050. Temporal trends were assessed based on age-standardized mortality rates, and future projections up to 2050 were calculated using a predictive model that considered attributable risk factors from the Global Burden of Disease Study 2021.

RESULTS: Age-standardized AKI-related mortality rate per 1,000,000 people remained stable from 1996 to 2021 (10.47 [95% confidence interval (CI), 8.84-12.11] to 9.94 [95% CI, 8.32-11.57]). Although age-standardized mortality rates were lower in high-income countries (HICs) compared to low- and middle-income countries (LMICs), HICs exhibited a modest but statistically significant increasing trend (from 5.83 per 1,000,000 people [95% CI, 4.21-7.46] to 7.30 [95% CI, 5.66-8.95]), whereas LMICs showed a declining trend (from 19.66 [95% CI, 16.78-22.53] to 15.33 [95% CI, 12.37-18.29]). Projections indicate that mortality will rise to 11.36 per 1,000,000 population (95% CI, 10.65-12.07) by 2050, primarily attributable to population aging.

CONCLUSION: This global time-series modeling study highlights rising AKI-related mortality in HICs and/or aging populations. These findings underscore the need for targeted interventions to mitigate future AKI-related deaths.

PMID:41342156 | DOI:10.23876/j.krcp.25.224