Categories
Nevin Manimala Statistics

Fast Bayesian inference for large occupancy datasets

Biometrics. 2022 Dec 29. doi: 10.1111/biom.13816. Online ahead of print.

ABSTRACT

In recent years, the study of species’ occurrence has benefited from the increased availability of large-scale citizen-science data. Whilst abundance data from standardized monitoring schemes are biased towards well-studied taxa and locations, opportunistic data are available for many taxonomic groups, from a large number of locations and across long timescales. Hence, these data provide opportunities to measure species’ changes in occurrence, particularly through the use of occupancy models, which account for imperfect detection. These opportunistic datasets can be substantially large, numbering hundreds of thousands of sites, and hence present a challenge from a computational perspective, especially within a Bayesian framework. In this paper, we develop a unifying framework for Bayesian inference in occupancy models that account for both spatial and temporal autocorrelation. We make use of the Pólya-Gamma scheme, which allows for fast inference, and incorporate spatio-temporal random effects using Gaussian processes (GPs), for which we consider two efficient approximations: Subset of Regressors and Nearest neighbour GPs. We apply our model to data on two UK butterfly species, one common and widespread and one rare, using records from the Butterflies for the New Millennium database, producing occupancy indices spanning 45 years. Our framework can be applied to a wide range of taxa, providing measures of variation in species’ occurrence, which are used to assess biodiversity change. This article is protected by copyright. All rights reserved.

PMID:36579700 | DOI:10.1111/biom.13816

By Nevin Manimala

Portfolio Website for Nevin Manimala