Towards the construction of avian chromosome assemblies

J Damas, M Farré, P Lithgow, MN Romanov… - Chromosome …, 2015 - kar.kent.ac.uk
Chromosome Research, 2015kar.kent.ac.uk
The advent of the next generation sequencing (NGS) made sequencing and scaffolding of
an entire animal genome a routine procedure. As the result we face a fast increase in the
number of animal genomes available due to the activities of large international genome
sequencing initiatives eg, Genome 10K (G10K) or smaller projects. However, the full
informative power of a sequenced genome could only be achieved when it is assembled
into chromosomes. Usually, a draft or nearly complete animal chromosome assembly is …
The advent of the next generation sequencing (NGS) made sequencing and scaffolding of an entire animal genome a routine procedure. As the result we face a fast increase in the number of animal genomes available due to the activities of large international genome sequencing initiatives e.g., Genome 10K (G10K) or smaller projects. However, the full informative power of a sequenced genome could only be achieved when it is assembled into chromosomes. Usually, a draft or nearly complete animal chromosome assembly is achieved through three steps: (i) constructing contigs based on read overlaps, (ii) merging contigs into scaffolds using pair-end reads, and (iii) mapping scaffolds on chromosomes with the use of physical or genetic maps. As the cost of mapping techniques is still much higher than sequencing, the genetic and physical maps are not available for the majority of the de novo sequenced genomes. To overcome this problem for assemblies that employ long-insert libraries (5 – 40 Kbp) we recently developed the reference-assisted chromosome assembly (RACA) algorithm (Kim et al., 2013). This method relies on both the raw sequencing data (reads) and comparative information; the latter is obtained from alignments between the target (de novo sequenced), a closely related (reference) and more distantly related (outgroup) genomes. Using RACA followed by the manual FISH or PCR verification steps we are reconstructing the chromosome organisation of 19 bird species sequenced by the G10K community. We use the publically available chicken (Gallus gallus) and zebra finch (Taeniopygia guttata) chromosome assemblies as either reference or outgroup for each reconstruction depending on their phylogenetic relationships with each target species. Initially, we established the optimal RACA parameters for a bird chromosome assembly reconstruction using the duck (Anas platyrhynchos) and budgerigar (Melopsittacus undulatus) super-scaffolds assembled with the support from physical maps. This step allowed us to test the reliability of RACA reconstructions for bird genomes. Due to a higher evolutionary conservation of the bird karyotype compared to the mammalian one, we have achieved ~97% accuracy of scaffold adjacencies in our predicted chromosome fragments compared to the ~93-96% accuracies reported for mammals (Kim et al., 2013). We detected ~4-28% of scaffolds in different target bird genomes that are either chimeric or containing genuine lineage-specific evolutionary breakpoint regions. Some of these scaffolds will be selected for follow up PCR or FISH verifications. All RACA reconstructions will become publicly available from our Evolution Highway comparative chromosome browser http://evolutionhighway.ncsa.uiuc.edu/birds/ and will be further utilised to study connections between the chromosome evolution, adaptation and phenotypic diversity in birds and other vertebrates.
kar.kent.ac.uk
以上显示的是最相近的搜索结果。 查看全部搜索结果