Benchmarking of alignment-free sequence comparison methods

A Zielezinski, HZ Girgis, G Bernard, CA Leimeister… - Genome biology, 2019 - Springer
Background Alignment-free (AF) sequence comparison is attracting persistent interest driven
by data-intensive applications. Hence, many AF procedures have been proposed in recent …

The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances

S Röhling, A Linne, J Schellhorn, M Hosseini… - Plos one, 2020 - journals.plos.org
We study the number N k of length-k word matches between pairs of evolutionarily related
DNA sequences, as a function of k. We show that the Jukes-Cantor distance between two …

Alignment-free sequence comparison: A systematic survey from a machine learning perspective

KS Bohnsack, M Kaden, J Abel… - IEEE/ACM Transactions …, 2022 - ieeexplore.ieee.org
The encounter of large amounts of biological sequence data generated during the last
decades and the algorithmic and hardware improvements have offered the possibility to …

The complexity landscape of viral genomes

JM Silva, D Pratas, T Caetano, S Matos - GigaScience, 2022 - academic.oup.com
Background Viruses are among the shortest yet highly abundant species that harbor
minimal instructions to infect cells, adapt, multiply, and exist. However, with the current …

Space-efficient representation of genomic k-mer count tables

Y Shibuya, D Belazzougui, G Kucherov - Algorithms for Molecular Biology, 2022 - Springer
Motivation k-mer counting is a common task in bioinformatic pipelines, with many dedicated
tools available. Many of these tools produce in output k-mer count tables containing both k …

Alignment-and reference-free phylogenomics with colored de Bruijn graphs

R Wittler - Algorithms for Molecular Biology, 2020 - Springer
Background The increasing amount of available genome sequence data enables large-
scale comparative studies. A common task is the inference of phylogenies—a challenging …

Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences

CA Leimeister, J Schellhorn, S Dörrer, M Gerth… - …, 2019 - academic.oup.com
Word-based or 'alignment-free'sequence comparison has become an active research area
in bioinformatics. While previous word-frequency approaches calculated rough measures of …

Multilocus marker-based delimitation of Salicornia persica and its population discrimination assisted by supervised machine learning approach

R Jamdade, K Al-Shaer, M Al-Sallani, E Al-Harthi… - Plos one, 2022 - journals.plos.org
The Salicornia L. has been considered one of the most taxonomically challenging genera
due to high morphological plasticity, intergradation between related species, and lack of …

Phylogenies from unaligned proteomes using sequence environments of amino acid residues

JC Aledo - Scientific reports, 2022 - nature.com
Alignment-free methods for sequence comparison and phylogeny inference have attracted a
great deal of attention in recent years. Several algorithms have been implemented in diverse …

Alignment‐free methods for polyploid genomes: quick and reliable genetic distance estimation

A VanWallendael, M Alvarez - Molecular ecology resources, 2022 - Wiley Online Library
Polyploid genomes pose several inherent challenges to population genetic analyses. While
alignment‐based methods are fundamentally limited in their applicability to polyploids …