Skip to Main Content
Non-biological synthetic spike-in controls and the AMPtk software pipeline improve mycobiome dataAuthor(s): Jonathan M. Palmer; Michelle A. Jusino; Mark T. Banik; Daniel L. Lindner
Publication Series: Scientific Journal (JRNL)
Station: Northern Research Station
Download Publication (2.0 MB)
DescriptionHigh-throughput amplicon sequencing (HTAS) of conserved DNA regions is a powerful technique to characterize microbial communities. Recently, spike-in mock communities have been used to measure accuracy of sequencing platforms and data analysis pipelines. To assess the ability of sequencing platforms and data processing pipelines using fungal internal transcribed spacer (ITS) amplicons, we created two ITS spike-in control mock communities composed of cloned DNA in plasmids: a biological mock community, consisting of ITS sequences from fungal taxa, and a synthetic mock community (SynMock), consisting of non-biological ITS-like sequences. Using these spike-in controls we show that: (1) a non-biological synthetic control (e.g., SynMock) is the best solution for parameterizing bioinformatics pipelines, (2) pre-clustering steps for variable length amplicons are critically important, (3) a major source of bias is attributed to the initial polymerase chain reaction (PCR) and thus HTAS read abundances are typically not representative of starting values. We developed AMPtk, a versatile software solution equipped to deal with variable length amplicons and quality filter HTAS data based on spike-in controls. While we describe herein a non-biological SynMock community for ITS sequences, the concept and AMPtk software can be widely applied to any HTAS dataset to improve data quality.
- Check the Northern Research Station web site to request a printed copy of this publication.
- Our on-line publications are scanned and captured using Adobe Acrobat.
- During the capture process some typographical errors may occur.
- Please contact Sharon Hobrla, firstname.lastname@example.org if you notice any errors which make this publication unusable.
- We recommend that you also print this page and attach it to the printout of the article, to retain the full citation information.
- This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.
CitationPalmer, Jonathan M.; Jusino, Michelle A.; Banik, Mark T.; Lindner, Daniel L. 2018. Non-biological synthetic spike-in controls and the AMPtk software pipeline improve mycobiome data. PeerJ. 6(2): e4925-. https://doi.org/10.7717/peerj.4925.
KeywordsMycobiome, Synthetic spike-in control, Amplicon toolkit, Fungi, Metabarcoding, High-throughput amplicon sequencing, Biodiversity, rRNA internal transcribed spacer, Environmental sequencing
- The "great" price spike of '93: an analysis of lumber and stumpage prices in the Pacific Northwest.
- Soil spot herbicides for single-stem hardwood control
- Distribution and mixing of old and new nonstructural carbon in two temperate trees
XML: View XML