Categories
Nevin Manimala Statistics

LMdist: Local Manifold distance accurately measures beta diversity in ecological gradients

Bioinformatics. 2023 Dec 7:btad727. doi: 10.1093/bioinformatics/btad727. Online ahead of print.

ABSTRACT

MOTIVATION: Differentiating ecosystems poses a complex, high-dimensional problem constrained by capturing relevant variation across species profiles. Researchers use pairwise distances and subsequent dimensionality reduction to highlight variation in a few dimensions. Despite popularity in analysis of ecological data, these low-dimensional visualizations can contain geometric abnormalities such as “arch” and “horseshoe” effects, potentially obscuring the impact of environmental gradients. These abnormalities appear in ordination but are in fact a product of oversaturated large pairwise distances.

RESULTS: We present Local Manifold distance (LMdist), an unsupervised algorithm which adjusts pairwise beta diversity measures to better represent true ecological distances between samples. Beta diversity measures can have a bounded dynamic range in depicting long environmental gradients with high species turnover. Using a graph structure, LMdist projects pairwise distances onto a manifold and traverses the manifold surface to adjust pairwise distances at the upper end of the beta diversity measure’s dynamic range. This allows for values beyond the range of the original measure. Not all datasets will have oversaturated pairwise distances, nor will capture variation that resembles a manifold, so LMdist adjusts only those pairwise values which may be undervalued in the presence of a sampled gradient. The adjusted distances serve as input for ordination and statistical testing. We demonstrate on real and simulated data that LMdist effectively recovers distances along known gradients and along complex manifolds such as the Swiss roll dataset. LMdist enables more powerful statistical tests for gradient effects and reveals variation orthogonal to the gradient.

AVAILABILITY: Available on GitHub at https://github.com/knights-lab/LMdist.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

PMID:38060267 | DOI:10.1093/bioinformatics/btad727

By Nevin Manimala

Portfolio Website for Nevin Manimala