Categories
Nevin Manimala Statistics

Assumptions about frequency-dependent architectures of complex traits bias measures of functional enrichment

Genet Epidemiol. 2021 Jun 22. doi: 10.1002/gepi.22388. Online ahead of print.

ABSTRACT

Linkage-Disequilibrium Score Regression (LDSC) is a popular framework for analyzing Genome-wide Association Studies (GWAS) summary statistics that allows for estimating single nucleotide polymorphism heritability, confounding, and functional enrichment of genetic variants with different annotations. Recent work has highlighted the influence of implicit and explicit assumptions of the model on the biological interpretation of the results. In this study, we explored a formulation of LDSC that replaces the r2 measure of LD with a recently proposed unbiased estimator of the D2 statistic. In addition to modest statistical difference across estimators, this derivation highlighted implicit and unrealistic assumptions about the relationship between allele frequency, effect size, and annotation status. We carry out a systematic comparison of alternative LDSC formulations by applying them to summary statistics from 47 GWAS traits. Our results show that commonly used models likely underestimate functional enrichment. These results highlight the importance of calibrating the LDSC model to achieve a more robust understanding of polygenic traits.

PMID:34157784 | DOI:10.1002/gepi.22388

By Nevin Manimala

Portfolio Website for Nevin Manimala