DNA sequencing data continue to progress toward longer reads with increasingly lower sequencing error rates. We focus on the critical problem of mapping, or aligning, low …
Abstract Motivation The Jaccard similarity on k-mer sets has shown to be a convenient proxy for sequence identity. By avoiding expensive base-level alignments and comparing reduced …
O Mutlu, C Firtina - 2023 60th ACM/IEEE Design Automation …, 2023 - ieeexplore.ieee.org
High-throughput sequencing (HTS) technologies have revolutionized the field of genomics, enabling rapid and cost-effective genome analysis for various applications. However, the …
T Shahroodi, M Miao, M Zahedi… - 2023 IEEE 34th …, 2023 - ieeexplore.ieee.org
The high execution time of DNA sequence alignment negatively affects many genomic studies that rely on sequence alignment results. Pre-alignment filtering was introduced as a …
G Greenberg, AN Ravi, I Shomorony - Bioinformatics, 2023 - academic.oup.com
Motivation Pairwise sequence alignment is a heavy computational burden, particularly in the context of third-generation sequencing technologies. This issue is commonly addressed by …
Seed design is important for sequence similarity search applications such as read mapping and average nucleotide identity (ANI) estimation. Although k-mers and spaced k-mers are …
S Wang, Y Jiang, L Che, RH Wang… - Nucleic Acids …, 2024 - academic.oup.com
Horizontal gene transfer (HGT) phenomena pervade the gut microbiome and significantly impact human health. Yet, no current method can accurately identify complete HGT events …
Computational complexity is a key limitation of genomic analyses. Thus, over the last 30 years, researchers have proposed numerous fast heuristic methods that provide …
Motivation Substrings of length k, commonly referred to as k-mers, play a vital role in sequence analysis. However, k-mers are limited to exact matches between sequences …