Abstract Motivation Burrows–Wheeler Transform (BWT) is a common component in full-text indices. Initially developed for data compression, it is particularly powerful for encoding …
Comprehensive collections approaching millions of sequenced genomes have become central information sources in the life sciences. However, the rapid growth of these …
This paper presents a new data structure, GIN-TONIC (G raph IN dexing T hrough O ptimal N ear I nterval C ompaction), designed to index arbitrary string-labelled directed graphs …
Compressed full-text indexes enable efficient sequence classification against a pangenome or tree-of-life index. Past work on compressed-index classification used matching statistics …
Aligning genomes into common coordinates is central to pangenome analysis and construction, but it is also computationally expensive. Multi-sequence maximal unique …
Pangenomes are becoming a powerful framework to perform many bioinformatics analyses taking into account the genetic variability of a population, thus reducing the bias introduced …
arXiv:2407.18956v1 [cs.DS] 13 Jul 2024 Page 1 MIOV: Reordering MOVI for even better locality Peter Perešıni1, Nathaniel K. Brown2, Travis Gagie3, and Ben Langmead2 1 …
As next-generation sequencing technologies produce deeper genome coverages at lower costs, there is a critical need for reliable computational host DNA removal in metagenomic …
A reference genome serves an important function for various genomic analyses; it acts as a template to be used to match sequencing reads to the genome and provides a coordinate …