J Biopharm Stat. 2025 Apr 20:1-13. doi: 10.1080/10543406.2025.2490725. Online ahead of print.
ABSTRACT
This paper introduces the novel methodology of differential projection pursuit and its applications to the analysis of large datasets. The method was applied to a cell flow cytometry dataset as an alternative approach to analyze this type of data. Multicolor cell flow cytometry is a well-established laboratory technique to identify cell subpopulations by measuring their physical and biochemical characteristics. Differential projection pursuit helps to find regions with maximal differences between two or more treatments or distributions. Data analysis in flow cytometry relies on gating, the process of manually selecting successive subpopulations of cells using two-dimensional plots. Plotting the variables only two at a time could mask the hidden structure present in the data, and manual selection makes the analysis inconsistent and arbitrary. The new methodology could automate flow cytometry analysis by utilizing the combination of projection pursuit, data nuggets, and factor analysis. When applied to flow cytometry data, differential projection pursuit allows researchers to quickly identify differences in cell populations exposed to different experimental conditions. This methodology could create a platform to explore differences in large datasets and improve the cell flow cytometry analysis clarity and reproducibility by considering the data in its true dimensional space and through automation, respectively.
PMID:40253621 | DOI:10.1080/10543406.2025.2490725