Skip to Main Content
U.S. Forest Service
Caring for the land and serving people

United States Department of Agriculture

Home > Search > Publication Information

  1. Share via EmailShare on FacebookShare on LinkedInShare on Twitter
    Dislike this pubLike this pub
    Author(s): M. Parks; A. Liston; R. Cronn
    Date: 2010
    Source: Acta Horticulturae. 859: 323-332
    Publication Series: Scientific Journal (JRNL)
    Station: Pacific Northwest Research Station
    PDF: View PDF  (2.48 MB)

    Description

    Massively parallel sequencing technologies (MPST) offer unprecedented opportunities for novel sequencing projects. MPST, while offering tremendous sequencing capacity, are typically most effective in resequencing projects (as opposed to the sequencing of novel genomes) due to the fact that sequence is returned in relatively short reads. Nonetheless, there is great interest in applying MPST to genome sequencing in non-model organisms. We have developed a bioinformatics pipeline to assemble short-read sequence data into nearly complete chloroplast genomes using a combination of de novo and reference-guided assembly, while decreasing reliance on a reference genome. Initially, short-read sequences are assembled into larger contigs using de novo assembly. De novo contigs are then aligned to the corresponding reference genome of the most closely related taxon available and merged to form a consensus sequence. The consensus sequence and reference are in turn 'merged' such that aligned de novo sequence remains unaffected while missing sequence is filled in using the reference sequence. This chimeric reference is then utilized in reference-guided assembly to align the original short-data, resulting in a draft plastome. Using two established Pinus reference plastomes, our method has been effective in the assembly of 33 chloroplast genomes within the genus Pinus, and results with four species representing other genera of Pinaceae suggest the method will be of general use in land plants, particularly once limitations of PCR-based chloroplast enrichment are overcome.

    Publication Notes

    • You may send email to pnw_pnwpubs@fs.fed.us to request a hard copy of this publication.
    • (Please specify exactly which publication you are requesting and your mailing address.)
    • We recommend that you also print this page and attach it to the printout of the article, to retain the full citation information.
    • This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.

    Citation

    Parks, M.; Liston, A.; Cronn, R. 2010. Meeting the challenges of non-referenced genome assembly from short-read sequence data. Acta Horticulturae. 859: 323-332.

    Keywords

    next-generation sequencing, massively parallel sequencing, Pinus, Illumina

    Related Search


    XML: View XML
Show More
Show Fewer
Jump to Top of Page