Bioinformatics. 2026 Jan 9:btag004. doi: 10.1093/bioinformatics/btag004. Online ahead of print.
ABSTRACT
SUMMARY: Sediment DNA–the ability to extract DNA from archaeological sediments– is an exciting new frontier in ancient DNA research, offering the potential to study individuals at a given archaeological site without destructive sampling. In recent years, several studies have demonstrated the promise of this approach by recovering hominin DNA from prehistoric sediments, including those dating back to the Middle or Late Pleistocene. However, a lack of open-source workflows for analysis of hominin sediment DNA samples poses a challenge for data processing and reproducibility of findings across studies. Here we introduce a snakemake workflow, sedimix, for processing genomic sequences from archaeological sediment DNA samples to identify hominin sequences and generate relevant summary statistics to assess the reliability of the pipeline. By performing simulations and comparing our results to two published studies with human DNA from ∼25,000 years ago (including shotgun data from a sediment sample and capture data from touch DNA recovered from a deer tooth pendant) we demonstrate that sedimix yields accurate and reliable inferences. sedimix offers a reliable and adaptable framework to aid in the analysis of sediment DNA datasets and improve reproducibility across studies.
AVAILABILITY AND IMPLEMENTATION: sedimix is available as an open-source software with the associated code, example data, and user manual with installation instructions available at https://github.com/jierui-cell/sedimix.A permanent archived version of this release is available via Zenodo: https://doi.org/10.5281/zenodo.17244854.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
PMID:41512286 | DOI:10.1093/bioinformatics/btag004