Next-generation sequencing: big data meets high performance computing

B Schmidt, A Hildebrandt - Drug discovery today, 2017 - Elsevier
Highlights•Analysis of massive NGS datasets poses difficult computational challenges.•Big
data algorithms are often adapted for NGS analysis.•HPC becomes pivotal as NGS …

GenStore: A high-performance in-storage processing system for genome sequence analysis

N Mansouri Ghiasi, J Park, H Mustafa, J Kim… - Proceedings of the 27th …, 2022 - dl.acm.org
Read mapping is a fundamental step in many genomics applications. It is used to identify
potential matches and differences between fragments (called reads) of a sequenced …

Performance optimization in DNA short-read alignment

R Wilton, AS Szalay - Bioinformatics, 2022 - academic.oup.com
Over the past decade, short-read sequence alignment has become a mature technology.
Optimized algorithms, careful software engineering and high-speed hardware have …

GenStore: A High-Performance and Energy-Efficient In-Storage Computing System for Genome Sequence Analysis

NM Ghiasi, J Park, H Mustafa, J Kim, A Olgun… - arXiv preprint arXiv …, 2022 - arxiv.org
Read mapping is a fundamental, yet computationally-expensive step in many genomics
applications. It is used to identify potential matches and differences between fragments …

Short read mapping: an algorithmic tour

S Canzar, SL Salzberg - Proceedings of the IEEE, 2015 - ieeexplore.ieee.org
Ultra-high-throughput next-generation sequencing (NGS) technology allows us to determine
the sequence of nucleotides of many millions of DNA molecules in parallel. Accompanied by …

Genpip: In-memory acceleration of genome analysis via tight integration of basecalling and read mapping

H Mao, M Alser, M Sadrosadati, C Firtina… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Nanopore sequencing is a widely-used high-throughput genome sequencing technology
that can sequence long fragments of a genome into raw electrical signals at low cost …

Enabling fast and energy-efficient FM-index exact matching using processing-near-memory

JM Herruzo, I Fernandez, S González-Navarro… - The Journal of …, 2021 - Springer
Memory bandwidth and latency constitutes a major performance bottleneck for many data-
intensive applications. While high-locality access patterns take advantage of the deep cache …

[HTML][HTML] Detection of haplotype-dependent allele-specific DNA methylation in WGBS data

J Abante, Y Fang, AP Feinberg, J Goutsias - Nature communications, 2020 - nature.com
In heterozygous genomes, allele-specific measurements can reveal biologically significant
differences in DNA methylation between homologous alleles associated with local changes …

Short-read aligner performance in germline variant identification

R Wilton, AS Szalay - Bioinformatics, 2023 - academic.oup.com
Motivation Read alignment is an essential first step in the characterization of DNA sequence
variation. The accuracy of variant-calling results depends not only on the quality of read …

Top-down performance profiling on nvidia's gpus

A Saiz, P Prieto, P Abad, JA Gregorio… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
The rise of data-intensive algorithms, such as Machine Learning ones, has meant a strong
diversification of Graphics Processing Units (GPU) in fields with intensive Data-Level …