Alignment-free sequence comparison: benefits, applications, and tools

A Zielezinski, S Vinga, J Almeida, WM Karlowski - Genome biology, 2017 - Springer
Alignment-free sequence analyses have been applied to problems ranging from whole-
genome phylogeny to the classification of protein families, identification of horizontally …

Benchmarking of alignment-free sequence comparison methods

A Zielezinski, HZ Girgis, G Bernard, CA Leimeister… - Genome biology, 2019 - Springer
Background Alignment-free (AF) sequence comparison is attracting persistent interest driven
by data-intensive applications. Hence, many AF procedures have been proposed in recent …

What can we do with 1000 plastid genomes?

J Tonti‐Filippini, PG Nevill, K Dixon, I Small - The Plant Journal, 2017 - Wiley Online Library
The plastid genome of plants is the smallest and most gene‐rich of the three genomes in
each cell and the one generally present in the highest copy number. As a result, obtaining …

Neural distance embeddings for biological sequences

G Corso, Z Ying, M Pándy… - Advances in …, 2021 - proceedings.neurips.cc
The development of data-dependent heuristics and representations for biological
sequences that reflect their evolutionary distance is critical for large-scale biological …

Fast alignment-free sequence comparison using spaced-word frequencies

CA Leimeister, M Boden, S Horwege, S Lindner… - …, 2014 - academic.oup.com
Motivation: Alignment-free methods for sequence comparison are increasingly used for
genome analysis and phylogeny reconstruction; they circumvent various difficulties of …

Skmer: assembly-free and alignment-free sample identification using genome skims

S Sarmashghi, K Bohmann, MT P. Gilbert, V Bafna… - Genome biology, 2019 - Springer
The ability to inexpensively describe taxonomic diversity is critical in this era of rapid climate
and biodiversity changes. The recent genome-skimming approach extends current …

Beyond DNA barcoding: The unrealized potential of genome skim data in sample identification

K Bohmann, S Mirarab, V Bafna, MTP Gilbert - 2020 - Wiley Online Library
Genetic tools are increasingly used to identify and discriminate between species. One key
transition in this process was the recognition of the potential of the ca 658bp fragment of the …

andi: Fast and accurate estimation of evolutionary distances between closely related genomes

B Haubold, F Klötzl, P Pfaffelhuber - Bioinformatics, 2015 - academic.oup.com
Motivation: A standard approach to classifying sets of genomes is to calculate their pairwise
distances. This is difficult for large samples. We have therefore developed an algorithm for …

APPLES: scalable distance-based phylogenetic placement with or without alignments

M Balaban, S Sarmashghi, S Mirarab - Systematic Biology, 2020 - academic.oup.com
Placing a new species on an existing phylogeny has increasing relevance to several
applications. Placement can be used to update phylogenies in a scalable fashion and can …

A novel fast vector method for genetic sequence comparison

Y Li, L He, R Lucy He, SST Yau - Scientific reports, 2017 - nature.com
With sharp increasing in biological sequences, the traditional sequence alignment methods
become unsuitable and infeasible. It motivates a surge of fast alignment-free techniques for …