Assembly of long, error-prone reads using repeat graphs

M Kolmogorov, J Yuan, Y Lin, PA Pevzner - Nature biotechnology, 2019 - nature.com
Accurate genome assembly is hampered by repetitive regions. Although long single
molecule sequencing reads are better able to resolve genomic repeats than short-read data …

[HTML][HTML] Linear time complexity de novo long read genome assembly with GoldRush

J Wong, L Coombe, V Nikolić, E Zhang, KM Nip… - Nature …, 2023 - nature.com
Current state-of-the-art de novo long read genome assemblers follow the Overlap-Layout-
Consensus paradigm. While read-to-read overlap–its most costly step–was improved in …

[HTML][HTML] HASLR: fast hybrid assembly of long reads

E Haghshenas, H Asghari, J Stoye, C Chauve, F Hach - Iscience, 2020 - cell.com
Third-generation sequencing technologies from companies such as Oxford Nanopore and
Pacific Biosciences have paved the way for building more contiguous and potentially gap …

Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads

A Bankevich, AV Bzikadze, M Kolmogorov… - Nature …, 2022 - nature.com
Although most existing genome assemblers are based on de Bruijn graphs, the construction
of these graphs for large genomes and large k-mer sizes has remained elusive. This …

GMcloser: closing gaps in assemblies accurately with a likelihood-based selection of contig or long-read alignments

S Kosugi, H Hirakawa, S Tabata - Bioinformatics, 2015 - academic.oup.com
Motivation: Genome assemblies generated with next-generation sequencing (NGS) reads
usually contain a number of gaps. Several tools have recently been developed to close the …

HINGE: long-read assembly achieves optimal repeat resolution

GM Kamath, I Shomorony, F Xia, TA Courtade… - Genome …, 2017 - genome.cshlp.org
Long-read sequencing technologies have the potential to produce gold-standard de novo
genome assemblies, but fully exploiting error-prone reads to resolve repeats remains a …

[HTML][HTML] Tigmint: correcting assembly errors using linked reads from large molecules

SD Jackman, L Coombe, J Chu, RL Warren… - BMC …, 2018 - Springer
Background Genome sequencing yields the sequence of many short snippets of DNA
(reads) from a genome. Genome assembly attempts to reconstruct the original genome from …

Time-and memory-efficient genome assembly with Raven

R Vaser, M Šikić - Nature Computational Science, 2021 - nature.com
Whole genome sequencing technologies are unable to invariably read DNA molecules
intact, a shortcoming that assemblers try to resolve by stitching the obtained fragments back …

Assemblathon 1: a competitive assessment of de novo short read assembly methods

D Earl, K Bradnam, JS John, A Darling, D Lin… - Genome …, 2011 - genome.cshlp.org
Low-cost short read sequencing technology has revolutionized genomics, though it is only
just becoming practical for the high-quality de novo assembly of a novel large genome. We …

Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation

S Koren, BP Walenz, K Berlin, JR Miller… - Genome …, 2017 - genome.cshlp.org
Long-read single-molecule sequencing has revolutionized de novo genome assembly and
enabled the automated reconstruction of reference-quality genomes. However, given the …