medRxiv [Preprint]. 2024 Dec 12:2024.12.11.24318870. doi: 10.1101/2024.12.11.24318870.
ABSTRACT
Complex disease genetics is a key area of research for reducing disease and improving human health. Genome-wide association studies (GWAS) help in this research by identifying regions of the genome that contribute to complex disease risk. However, GWAS are computationally intensive and require access to individual-level genetic and health information, which presents concerns about privacy and imposes costs on researchers seeking to study complex diseases. Publicly released pan-biobank GWAS summary statistics provide immediate access to results for a subset of phenotypes, but they do not inform about all phenotypes or hand-crafted phenotype definitions, which are often more relevant to study. Here, we present WebGWAS, a new tool that allows researchers to obtain GWAS summary statistics for a phenotype of interest without needing access to individual-level genetic and phenotypic data. Our public web app can be used to study custom phenotype definitions, including inclusion and exclusion criteria, and to produce approximate GWAS summary statistics for that phenotype. WebGWAS computes approximate GWAS summary statistics very quickly ( < 10 seconds), and it does not store private health information. We also show how the statistical approximation underlying WebGWAS can be used to accelerate the computation of multi-phenotype GWAS among correlated phenotypes. Our tool provides a faster approach to GWAS for researchers interested in complex disease, providing approximate summary statistics in short order, without the need to collect, process, and produce GWAS results. Overall, this method advances complex disease research by facilitating more accessible and cost-effective genetic studies using large observational data.
PMID:39711729 | PMC:PMC11661389 | DOI:10.1101/2024.12.11.24318870