Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact
representation based on short words (k-mers) that is ideal for high coverage, very short read
(25–50 bp) data sets. Applying Velvet to very short reads and paired-ends information only,
one can produce contigs of significant length, up to 50-kb N50 length in simulations of
prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa …