Overlap graphs and de Bruijn graphs: data structures for de novo genome assembly in the big data era

R Rizzi, S Beretta, M Patterson, Y Pirola, M Previtali… - Quantitative …, 2019 - Springer
Background De novo genome assembly relies on two kinds of graphs: de Bruijn graphs and
overlap graphs. Overlap graphs are the basis for the Celera assembler, while de Bruijn …

Parallel string graph construction and transitive reduction for de novo genome assembly

G Guidi, O Selvitopi, M Ellis, L Oliker… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
One of the most computationally intensive tasks in computational biology is de novo genome
assembly, the decoding of the sequence of an unknown genome from redundant and …

Multithread multistring Burrows–Wheeler transform and longest common prefix array

P Bonizzoni, G Della Vedova, Y Pirola… - Journal of …, 2019 - liebertpub.com
Indexing huge collections of strings, such as those produced by the widespread sequencing
technologies, heavily relies on multistring generalizations of the Burrows–Wheeler transform …

Gpu-accelerated large-scale genome assembly

S Goswami, K Lee, S Shams… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Spurred by a widening gap between hardware accelerators and traditional processors,
numerous bioinformatics applications have harnessed the computing power of GPUs and …

Approximate suffix-prefix dictionary queries

W Zuba, G Loukides, SP Pissis… - … of Computer Science …, 2024 - drops.dagstuhl.de
In the all-pairs suffix-prefix (APSP) problem [Gusfield et al., Inf. Process. Lett. 1992], we are
given a dictionary R of r strings, S₁,…, S_r, of total length n, and we are asked to find the …

Can Formal Languages help Pangenomics to represent and analyze multiple genomes?

P Bonizzoni, C De Felice, Y Pirola, R Rizzi… - … on Developments in …, 2022 - Springer
Graph pangenomics is a new emerging field in computational biology that is changing the
traditional view of a reference genome from a linear sequence to a new paradigm: a …

[PDF][PDF] Suffix-prefix queries on a dictionary

G Loukides, SP Pissis, SV Thankachan… - CPM 2023-34th Annual …, 2023 - inria.hal.science
In the all-pairs suffix-prefix (APSP) problem, we are given a dictionary R of k strings, S1,...,
Sk, of total length n, and we are asked to find the length SPLi, j of the longest string that is …

On the longest common prefix of suffixes in an inverse Lyndon factorization and other properties

P Bonizzoni, C De Felice, R Zaccagnino… - Theoretical Computer …, 2021 - Elsevier
The Lyndon factorization of a word has been largely studied and recently variants of it have
been introduced and investigated with different motivations. In particular, the canonical …

An analysis of automated software diversity using unstructured text analytics

AS Gearhart, PA Hamilton… - 2018 48th Annual IEEE …, 2018 - ieeexplore.ieee.org
Automated software diversity promises to reduce an attacker's ability to reuse exploits across
application instances. However, many questions remain regarding the efficacy of and …

Lyndon words versus inverse Lyndon words: Queries on suffixes and bordered words

P Bonizzoni, C De Felice, R Zaccagnino… - … Conference on Language …, 2020 - Springer
The Lyndon factorization of a word has been extensively studied in different contexts and
several variants of it have been proposed. In particular, the canonical inverse Lyndon …