Categories
Nevin Manimala Statistics

High-dimensional covariance matrices tests for analyzing multi-tumor gene expression data

Stat Methods Med Res. 2021 Jul 7:9622802211009257. doi: 10.1177/09622802211009257. Online ahead of print.

ABSTRACT

By collecting multiple sets per subject in microarray data, gene sets analysis requires characterize intra-subject variation using gene expression profiling. For each subject, the data can be written as a matrix with the different subsets of gene expressions (e.g. multiple tumor types) indexing the rows and the genes indexing the columns. To test the assumption of intra-subject (tumor) variation, we present and perform tests of multi-set sphericity and multi-set identity of covariance structures across subjects (tumor types). We demonstrate by both theoretical and empirical studies that the tests have good properties. We applied the proposed tests on The Cancer Genome Atlas (TCGA) and tested covariance structures for the gene expressions across several tumor types.

PMID:34232835 | DOI:10.1177/09622802211009257

By Nevin Manimala

Portfolio Website for Nevin Manimala