[HTML][HTML] A survey of mapping algorithms in the long-reads era

K Sahlin, T Baudeau, B Cazaux, C Marchet - Genome Biology, 2023 - Springer
It has been over a decade since the first publication of a method dedicated entirely to
mapping long-reads. The distinctive characteristics of long reads resulted in methods …

Efficient mapping of accurate long reads in minimizer space with mapquik

B Ekim, K Sahlin, P Medvedev, B Berger… - Genome …, 2023 - genome.cshlp.org
DNA sequencing data continue to progress toward longer reads with increasingly lower
sequencing error rates. We focus on the critical problem of mapping, or aligning, low …

Minmers are a generalization of minimizers that enable unbiased local Jaccard estimation

B Kille, E Garrison, TJ Treangen, AM Phillippy - Bioinformatics, 2023 - academic.oup.com
Abstract Motivation The Jaccard similarity on k-mer sets has shown to be a convenient proxy
for sequence identity. By avoiding expensive base-level alignments and comparing reduced …

Accelerating genome analysis via algorithm-architecture co-design

O Mutlu, C Firtina - 2023 60th ACM/IEEE Design Automation …, 2023 - ieeexplore.ieee.org
High-throughput sequencing (HTS) technologies have revolutionized the field of genomics,
enabling rapid and cost-effective genome analysis for various applications. However, the …

SieveMem: a computation-in-memory architecture for fast and accurate pre-alignment

T Shahroodi, M Miao, M Zahedi… - 2023 IEEE 34th …, 2023 - ieeexplore.ieee.org
The high execution time of DNA sequence alignment negatively affects many genomic
studies that rely on sequence alignment results. Pre-alignment filtering was introduced as a …

LexicHash: sequence similarity estimation via lexicographic comparison of hashes

G Greenberg, AN Ravi, I Shomorony - Bioinformatics, 2023 - academic.oup.com
Motivation Pairwise sequence alignment is a heavy computational burden, particularly in the
context of third-generation sequencing technologies. This issue is commonly addressed by …

Entropy predicts sensitivity of pseudorandom seeds

BD Maier, K Sahlin - Genome Research, 2023 - genome.cshlp.org
Seed design is important for sequence similarity search applications such as read mapping
and average nucleotide identity (ANI) estimation. Although k-mers and spaced k-mers are …

[HTML][HTML] Enhancing insights into diseases through horizontal gene transfer event detection from gut microbiome

S Wang, Y Jiang, L Che, RH Wang… - Nucleic Acids …, 2024 - academic.oup.com
Horizontal gene transfer (HGT) phenomena pervade the gut microbiome and significantly
impact human health. Yet, no current method can accurately identify complete HGT events …

SequenceLab: A Comprehensive Benchmark of Computational Methods for Comparing Genomic Sequences

MD Rumpf, M Alser, AE Gollwitzer, J Lindegger… - arXiv preprint arXiv …, 2023 - arxiv.org
Computational complexity is a key limitation of genomic analyses. Thus, over the last 30
years, researchers have proposed numerous fast heuristic methods that provide …

Designing efficient randstrobes for sequence similarity analyses

M Karami, A Soltani Mohammadi, M Martin… - …, 2024 - academic.oup.com
Motivation Substrings of length k, commonly referred to as k-mers, play a vital role in
sequence analysis. However, k-mers are limited to exact matches between sequences …