Towards population-scale long-read sequencing

W De Coster, MH Weissensteiner… - Nature Reviews …, 2021 - nature.com
Long-read sequencing technologies have now reached a level of accuracy and yield that
allows their application to variant detection at a scale of tens to thousands of samples …

Variant calling and benchmarking in an era of complete human genome sequences

ND Olson, J Wagner, N Dwarshuis, KH Miga… - Nature Reviews …, 2023 - nature.com
Genetic variant calling from DNA sequencing has enabled understanding of germline
variation in hundreds of thousands of humans. Sequencing technologies and variant-calling …

A draft human pangenome reference

WW Liao, M Asri, J Ebler, D Doerr, M Haukness… - Nature, 2023 - nature.com
Abstract Here the Human Pangenome Reference Consortium presents a first draft of the
human pangenome reference. The pangenome contains 47 phased, diploid assemblies …

High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios

M Byrska-Bishop, US Evani, X Zhao, AO Basile… - Cell, 2022 - cell.com
Summary The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-
genome sequencing (WGS) data consented for public distribution without access or use …

Pangenomics enables genotyping of known structural variants in 5202 diverse genomes

J Sirén, J Monlong, X Chang, AM Novak, JM Eizenga… - Science, 2021 - science.org
INTRODUCTION Modern genomics depends on inexpensive short-read sequencing.
Sequenced reads up to a few hundred base pairs in length are computationally mapped to …

Haplotype-aware variant calling with PEPPER-Margin-DeepVariant enables high accuracy in nanopore long-reads

K Shafin, T Pesout, PC Chang, M Nattestad… - Nature …, 2021 - nature.com
Long-read sequencing has the potential to transform variant detection by reaching currently
difficult-to-map regions and routinely linking together adjacent variations to enable read …

Semi-automated assembly of high-quality diploid human reference genomes

ED Jarvis, G Formenti, A Rhie, A Guarracino, C Yang… - Nature, 2022 - nature.com
The current human reference genome, GRCh38, represents over 20 years of effort to
generate a high-quality assembly, which has benefitted society,. However, it still has many …

DeepConsensus improves the accuracy of sequences with a gap-aware sequence transformer

G Baid, DE Cook, K Shafin, T Yun… - Nature …, 2023 - nature.com
Circular consensus sequencing with Pacific Biosciences (PacBio) technology generates
long (10–25 kilobases), accurate 'HiFi'reads by combining serial observations of a DNA …

Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation

M Kolmogorov, KJ Billingsley, M Mastoras… - Nature …, 2023 - nature.com
Long-read sequencing technologies substantially overcome the limitations of short-reads
but have not been considered as a feasible replacement for population-scale projects, being …

Symphonizing pileup and full-alignment for deep learning-based long-read variant calling

Z Zheng, S Li, J Su, AWS Leung, TW Lam… - Nature Computational …, 2022 - nature.com
Deep learning-based variant callers are becoming the standard and have achieved superior
single nucleotide polymorphisms calling performance using long reads. Here we present …