Categories
Nevin Manimala Statistics

cytoKernel: robust kernel embeddings for assessing differential expression of single cell data

Bioinformatics. 2025 Jul 14:btaf399. doi: 10.1093/bioinformatics/btaf399. Online ahead of print.

ABSTRACT

MOTIVATION: High-throughput sequencing of single-cell data can be used to rigorously evaluate cell specification and enable intricate variations between groups or conditions to be identified. Many popular existing methods for differential expression target differences in aggregate measurement (mean, median, sum) and limit their approaches to detect only global differential changes.

RESULTS: We present a robust method for differential expression of single-cell data using a kernel-based score test, cytoKernel. CytoKernel is specifically designed to assess the differential expression of single-cell RNA sequencing and high-dimensional flow or mass cytometry data using the full probability distribution pattern. cytoKernel is based on kernel embeddings which employs the probability distributions of the single-cell data, by calculating the pairwise divergence/distance between distributions of subjects. It can detect both patterns involving changes in the aggregate, as well as more elusive variations that are often overlooked due to the multimodal characteristics of single-cell data. We performed extensive benchmarks across both simulated and real data sets from mass cytometry data and single-cell RNA sequencing. The cytoKernel procedure effectively controls the False Discovery Rate (FDR) and shows favourable performance compared to existing methods. The method is able to identify more differential patterns than existing approaches. We apply cytoKernel to assess gene expression and protein marker expression differences from cell subpopulations in various publicly available single-cell RNAseq and mass cytometry data sets.

AVAILABILITY AND IMPLEMENTATION: The methods described in this paper are implemented in the open-source R package cytoKernel, which is freely available from Bioconductor at http://bioconductor.org/packages/cytoKernel.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

PMID:40658464 | DOI:10.1093/bioinformatics/btaf399

By Nevin Manimala

Portfolio Website for Nevin Manimala