Categories
Nevin Manimala Statistics

A Fully-Integrated Bayesian Approach for the Imputation and Analysis of Derived Outcome Variables With Missingness

Stat Med. 2026 Jan;45(1-2):e70383. doi: 10.1002/sim.70383.

ABSTRACT

Derived variables are variables that are constructed from one or more source variables through established mathematical operations or algorithms. For example, body mass index (BMI) is a derived variable constructed from two source variables: weight and height. When using a derived variable as the outcome in a statistical model, complications arise when some of the source variables have missing values. In this paper, we propose how one can define a single fully integrated Bayesian model to simultaneously impute missing values and sample from the posterior. We compare our proposed method with alternative approaches that rely on multiple imputation (MI), with examples including an analysis to estimate the risk of microcephaly (a derived variable based on sex, gestational age, and head circumference at birth) in newborns exposed to the ZIKA virus.

PMID:41569594 | DOI:10.1002/sim.70383

By Nevin Manimala

Portfolio Website for Nevin Manimala