External memory BWT and LCP computation for sequence collections with applications

L Egidi, FA Louza, G Manzini, GP Telles - Algorithms for Molecular Biology, 2019 - Springer
Background Sequencing technologies produce larger and larger collections of
biosequences that have to be stored in compressed indices supporting fast search …

Multithread multistring Burrows–Wheeler transform and longest common prefix array

P Bonizzoni, G Della Vedova, Y Pirola… - Journal of …, 2019 - liebertpub.com
Indexing huge collections of strings, such as those produced by the widespread sequencing
technologies, heavily relies on multistring generalizations of the Burrows–Wheeler transform …

Space-efficient computation of the LCP array from the Burrows-Wheeler transform

N Prezza, G Rosone - arXiv preprint arXiv:1901.05226, 2019 - arxiv.org
We show that the Longest Common Prefix Array of a text collection of total size n on
alphabet [1,{\sigma}] can be computed from the Burrows-Wheeler transformed collection in …

Computing the multi-string BWT and LCP array in external memory

P Bonizzoni, G Della Vedova, Y Pirola… - Theoretical Computer …, 2021 - Elsevier
Indexing very large collections of strings, such as those produced by the widespread next
generation sequencing technologies, heavily relies on multi-string generalization of the …

Metagenomic analysis through the extended Burrows-Wheeler transform

V Guerrini, FA Louza, G Rosone - BMC bioinformatics, 2020 - Springer
Abstract Background The development of Next Generation Sequencing (NGS) has had a
major impact on the study of genetic sequences. Among problems that researchers in the …

[HTML][HTML] Space-efficient construction of compressed suffix trees

N Prezza, G Rosone - Theoretical Computer Science, 2021 - Elsevier
We show how to build several data structures of central importance to string processing by
taking as input the Burrows-Wheeler transform (BWT) and using small extra working space …

Lightweight metagenomic classification via eBWT

V Guerrini, G Rosone - International Conference on Algorithms for …, 2019 - Springer
Abstract The development of Next Generation Sequencing has had a major impact on the
study of genetic sequences, and in particular, on the advancement of metagenomics, whose …

Computing the BWT and LCP array of a set of strings in external memory

P Bonizzoni, G Della Vedova, Y Pirola… - arXiv preprint arXiv …, 2017 - arxiv.org
Indexing very large collections of strings, such as those produced by the widespread next
generation sequencing technologies, heavily relies on multistring generalization of the …