Multiple genome alignment in the telomere-to-telomere assembly era

B Kille, A Balaji, FJ Sedlazeck, M Nute, TJ Treangen - Genome Biology, 2022 - Springer
With the arrival of telomere-to-telomere (T2T) assemblies of the human genome comes the
computational challenge of efficiently and accurately constructing multiple genome …

Opportunities and challenges of data-driven virus discovery

C Lauber, S Seitz - Biomolecules, 2022 - mdpi.com
Virus discovery has been fueled by new technologies ever since the first viruses were
discovered at the end of the 19th century. Starting with mechanical devices that provided …

Ensembl Genomes 2022: an expanding genome resource for non-vertebrates

AD Yates, J Allen, RM Amode, AG Azov… - Nucleic acids …, 2022 - academic.oup.com
Abstract Ensembl Genomes (https://www. ensemblgenomes. org) provides access to non-
vertebrate genomes and analysis complementing vertebrate resources developed by the …

Minimizer-space de Bruijn graphs: Whole-genome assembly of long reads in minutes on a personal computer

B Ekim, B Berger, R Chikhi - Cell systems, 2021 - cell.com
DNA sequencing data continue to progress toward longer reads with increasingly lower
sequencing error rates. Here, we define an algorithmic approach, mdBG, that makes use of …

Genomic epidemiology reveals multidrug resistant plasmid spread between Vibrio cholerae lineages in Yemen

F Lassalle, S Al-Shalali, M Al-Hakimi, E Njamkepo… - Nature …, 2023 - nature.com
Since 2016, Yemen has been experiencing the largest cholera outbreak in modern history.
Multidrug resistance (MDR) emerged among Vibrio cholerae isolates from cholera patients …

Themisto: a scalable colored k-mer index for sensitive pseudoalignment against hundreds of thousands of bacterial genomes

JN Alanko, J Vuohtoniemi, T Mäklin, SJ Puglisi - Bioinformatics, 2023 - academic.oup.com
Motivation Huge datasets containing whole-genome sequences of bacterial strains are now
commonplace and represent a rich and important resource for modern genomic …

Extremely fast construction and querying of compacted and colored de Bruijn graphs with GGCAT

A Cracco, AI Tomescu - Genome Research, 2023 - genome.cshlp.org
Compacted de Bruijn graphs are one of the most fundamental data structures in
computational genomics. Colored compacted de Bruijn graphs are a variant built on a …

Fulgor: a fast and compact k-mer index for large-scale matching and color queries

J Fan, J Khan, NP Singh, GE Pibiri, R Patro - Algorithms for Molecular …, 2024 - Springer
The problem of sequence identification or matching—determining the subset of reference
sequences from a given collection that are likely to contain a short, queried nucleotide …

Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2

J Khan, M Kokot, S Deorowicz, R Patro - Genome biology, 2022 - Springer
The de Bruijn graph is a key data structure in modern computational genomics, and
construction of its compacted variant resides upstream of many genomic analyses. As the …

Accurate and fast graph-based pangenome annotation and clustering with ggCaller

ST Horsfield, G Tonkin-Hill, NJ Croucher… - Genome …, 2023 - genome.cshlp.org
Bacterial genomes differ in both gene content and sequence mutations, which underlie
extensive phenotypic diversity, including variation in susceptibility to antimicrobials or …