Categories
Nevin Manimala Statistics

A Bayesian genomic selection approach incorporating prior feature ordering and population structures with application to coronary artery disease

Stat Methods Med Res. 2023 Jun 28:9622802231181231. doi: 10.1177/09622802231181231. Online ahead of print.

ABSTRACT

Coronary artery disease is one of the most common types of cardiovascular disease. Death from coronary heart disease is influenced by genetic factors in both women and men. In this article, we propose a novel Bayesian variable selection framework for the identification of important genetic variants associated with coronary artery disease disease status. Instead of treating each feature independently as in conventional Bayesian variable selection methods, we propose an innovative prior for the inclusion probabilities of genetic variants that accounts for their ordering structure. We assume that neighboring variants are more likely to be selected together as they tend to be highly correlated and have similar biological functions. Additionally, we propose to group participating subjects based on underlying population structure and fit separate regressions, so that the regression coefficients can better reflect different disease risks in different population groups. Our approach borrows strength across regression models through an innovative prior inspired by the Markov random fields. The proposed framework can improve variable selection and prediction performances as demonstrated in the simulation studies. We also apply the proposed framework to the CATHeterization GENetics data with binary Coronary artery disease disease status.

PMID:37376889 | DOI:10.1177/09622802231181231

By Nevin Manimala

Portfolio Website for Nevin Manimala