Categories
Nevin Manimala Statistics

A proximal distance algorithm for likelihood-based sparse covariance estimation

Biometrika. 2022 Dec;109(4):1047-1066. doi: 10.1093/biomet/asac011. Epub 2022 Feb 16.

ABSTRACT

This paper addresses the task of estimating a covariance matrix under a patternless sparsity assumption. In contrast to existing approaches based on thresholding or shrinkage penalties, we propose a likelihood-based method that regularizes the distance from the covariance estimate to a symmetric sparsity set. This formulation avoids unwanted shrinkage induced by more common norm penalties, and enables optimization of the resulting nonconvex objective by solving a sequence of smooth, unconstrained subproblems. These subproblems are generated and solved via the proximal distance version of the majorization-minimization principle. The resulting algorithm executes rapidly, gracefully handles settings where the number of parameters exceeds the number of cases, yields a positive-definite solution, and enjoys desirable convergence properties. Empirically, we demonstrate that our approach outperforms competing methods across several metrics, for a suite of simulated experiments. Its merits are illustrated on international migration data and a case study on flow cytometry. Our findings suggest that the marginal and conditional dependency networks for the cell signalling data are more similar than previously concluded.

PMID:38094986 | PMC:PMC10716840 | DOI:10.1093/biomet/asac011

By Nevin Manimala

Portfolio Website for Nevin Manimala