Categories
Nevin Manimala Statistics

CPSM: An R Package for Cancer Patient Survival Risk Model Using Transcriptomics and Clinical Data

Gigascience. 2026 Jun 2:giag067. doi: 10.1093/gigascience/giag067. Online ahead of print.

ABSTRACT

Traditional Kaplan-Meier curves capture aggregate survival trends within broad patient subgroups but overlook the heterogeneity of individual patients. In contrast, single-patient survival risk models bridge this gap by incorporating each patient’s unique clinical, genomic, and demographic characteristics, generating personalized survival curves. These individualized visualizations enhance patient-clinician communication by translating complex statistics into intuitive, time-based visuals that are easier to interpret. However, the complexity, high dimensionality, and heterogeneity of multi-omics data present significant challenges for analysis, interpretation, and model development. To address these challenges, we introduce the Cancer Patient Survival Model (CPSM), an R package designed to deliver individualized survival and risk predictions through a fully integrated, reproducible computational pipeline. CPSM includes 10 core functions organized into four key steps: (1) Data Preprocessing and Normalization, (2) Feature Selection, (3) Survival Risk-Group Prediction Modeling, and (4) Visualization and Nomogram Construction. We demonstrate the utility of CPSM using publicly available TCGA datasets for four cancer types: glioblastoma multiforme (GBM), acute myeloid leukemia (LAML), pancreatic adenocarcinoma (PAAD) and breast invasive cancer (BRCA). CPSM efficiently handles high-dimensional datasets with over 60,000 RNA transcripts and diverse clinical variables, enabling robust and interpretable individualized survival predictions under varying data conditions. Model performance was evaluated using repeated cross-validation with uncertainty quantification, ensuring robust and reliable estimates in high-dimensional, small-sample settings. In summary, CPSM provides an efficient, user-friendly, end-to-end solution for integrating patient data and generating personalized survival and risk predictions. Its integrated visual tools enhance interpretability and support more informed clinical decision-making. The package is freely available on Bioconductor (https://bioconductor.org/packages/devel/bioc/html/CPSM.html) and GitHub (https://github.com/hks5august/CPSM).

PMID:42233233 | DOI:10.1093/gigascience/giag067

By Nevin Manimala

Portfolio Website for Nevin Manimala