Stat Methods Med Res. 2021 Jul 7:9622802211009257. doi: 10.1177/09622802211009257. Online ahead of print.
ABSTRACT
By collecting multiple sets per subject in microarray data, gene sets analysis requires characterize intra-subject variation using gene expression profiling. For each subject, the data can be written as a matrix with the different subsets of gene expressions (e.g. multiple tumor types) indexing the rows and the genes indexing the columns. To test the assumption of intra-subject (tumor) variation, we present and perform tests of multi-set sphericity and multi-set identity of covariance structures across subjects (tumor types). We demonstrate by both theoretical and empirical studies that the tests have good properties. We applied the proposed tests on The Cancer Genome Atlas (TCGA) and tested covariance structures for the gene expressions across several tumor types.
PMID:34232835 | DOI:10.1177/09622802211009257