Metagraph: Indexing and analysing nucleotide archives at petabase-scale

M Karasikov, H Mustafa, D Danciu, C Barber… - BioRxiv, 2020 - biorxiv.org
The amount of biological sequencing data available in public repositories is growing
exponentially, forming an invaluable biomedical research resource. Yet, making all this …

Applications of de Bruijn graphs in microbiome research

K Dufault‐Thompson, X Jiang - Imeta, 2022 - Wiley Online Library
High‐throughput sequencing has become an increasingly central component of microbiome
research. The development of de Bruijn graph‐based methods for assembling high …

[HTML][HTML] Buffering updates enables efficient dynamic de Bruijn graphs

J Alanko, B Alipanahi, J Settle, C Boucher… - Computational and …, 2021 - Elsevier
Abstract Motivation: The de Bruijn graph has become a ubiquitous graph model for
biological data ever since its initial introduction in the late 1990s. It has been used for a …

Graphite: painting genomes using a colored de Bruijn graph

R Beeloo, AL Zomer, S Deorowicz… - NAR Genomics and …, 2024 - academic.oup.com
The recent growth of microbial sequence data allows comparisons at unprecedented scales,
enabling the tracking of strains, mobile genetic elements, or genes. Querying a genome …

Advances in practical k-mer sets: essentials for the curious

C Marchet - arXiv preprint arXiv:2409.05210, 2024 - arxiv.org
This paper provides a comprehensive survey of data structures for representing k-mer sets,
which are fundamental in high-throughput sequencing analysis. It categorizes the methods …

Scalable Annotated Genome Graphs for Representing Sequence Data

M Karasikov - 2023 - research-collection.ethz.ch
Technological advances made over the last decades in sequencing technologies have led
to continuous improvements of quality and ever-decreasing costs of sequencing. All this …

Practical Implementations of Compressed RAM

S Jo, W Park, K Sadakane… - 2023 Data Compression …, 2023 - ieeexplore.ieee.org
Given a string S over an alphabet of size σ, we consider practical implementations of
extended compressed RAM on S, which supports access, replace, lnsert, and delete …

Algorithms for efficient sensitive search and sample comparison on petabase-scale genomics data

H Mustafa - 2022 - research-collection.ethz.ch
Ever-decreasing genome sequencing costs have led to an explosion in sequencing
throughput, with the global sequencing capacity expected to exceed one exabyte per year in …

Computational Methods for the Analysis of Mitochondrial Genomes

L Fiedler - 2024 - ul.qucosa.de
Abstract (EN) Much of our understanding of eukaryotic life has come from studying
mitochondrial DNA, giving rise to leading hypotheses in evolution. To enable these studies …

Variants and Applications of Colored De Bruijn Graphs

B Alipanahi - 2020 - search.proquest.com
Colored de Bruijn graphs, extensions of the de Bruijn graphs are fundamental data
structures for the analysis of high-throughput sequencing data. Although colored de Bruin …