Nevin Manimala Statistics

Linear: a framework to enable existing software to resolve structural variants in long reads with flexible and efficient alignment-free statistical models

Brief Bioinform. 2023 Mar 3:bbad071. doi: 10.1093/bib/bbad071. Online ahead of print.


Alignment is the cornerstone of many long-read pipelines and plays an essential role in resolving structural variants (SVs). However, forced alignments of SVs embedded in long reads, inflexibility of integrating novel SVs models and computational inefficiency remain problems. Here, we investigate the feasibility of resolving long-read SVs with alignment-free algorithms. We ask: (1) Is it possible to resolve long-read SVs with alignment-free approaches? and (2) Does it provide an advantage over existing approaches? To this end, we implemented the framework named Linear, which can flexibly integrate alignment-free algorithms such as the generative model for long-read SV detection. Furthermore, Linear addresses the problem of compatibility of alignment-free approaches with existing software. It takes as input long reads and outputs standardized results existing software can directly process. We conducted large-scale assessments in this work and the results show that the sensitivity, and flexibility of Linear outperform alignment-based pipelines. Moreover, the computational efficiency is orders of magnitude faster.

PMID:36869850 | DOI:10.1093/bib/bbad071

By Nevin Manimala

Portfolio Website for Nevin Manimala