Bioinformatics. 2021 May 14:btab378. doi: 10.1093/bioinformatics/btab378. Online ahead of print.
ABSTRACT
SUMMARY: The sparse allele vectors (SAV) file format is an efficient storage format for large-scale DNA variation data and is designed for high throughput association analysis by leveraging techniques for fast deserialization of data into computer memory. A command line interface has been developed to complement the storage format and supports basic features like importing, exporting and subsetting. Additionally, a C ++ programming API is available allowing for easy integration into analysis software.
AVAILABILITY AND IMPLEMENTATION: https://github.com/statgen/savvy.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
PMID:33989384 | DOI:10.1093/bioinformatics/btab378