Skip to Main Content
Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus woodAuthor(s): Tuo He; Lichao Jiao; Alex C. Wiedenhoeft; Yafang Yin
Source: Planta. 249(5): 1617-1625.
Publication Series: Scientific Journal (JRNL)
Station: Forest Products Laboratory
View PDF (2.0 MB)
DescriptionDNA barcoding is a promising tool to combat illegal logging and associated trade, and the development of reliable and efcient analytical methods is essential for its extensive application in the trade of wood and in the forensics of natural materials more broadly. In this study, 120 DNA sequences of four barcodes (ITS2, matK, ndhF-rpl32, and rbcL) generated in our previous study and 85 downloaded from National Center for Biotechnology Information (NCBI) were collected to establish a reference data set for six commercial Pterocarpus woods. MLAs (BLOG, BP-neural network, SMO and J48) were compared with distance- (TaxonDNA) and tree-based (NJ tree) methods based on identifcation accuracy and cost-efectiveness across these six species, and also were applied to discriminate the CITES-listed species Pterocarpus santalinus from its anatomically similar species P. tinctorius for forensic identifcation. MLAs provided higher identifcation accuracy (30.8–100%) than distance- (15.1–97.4%) and tree-based methods (11.1–87.5%), with SMO performing the best among the machine learning classifers. The two-locus combination ITS2 + matK when using SMO classifer exhibited the highest resolution (100%) with the fewest barcodes for discriminating the six Pterocarpus species. The CITES-listed species P. santalinus was discriminated successfully from P. tinctorius using MLAs with a single barcode, ndhF-rpl32. This study shows that MLAs provided higher identifcation accuracy and cost-efectiveness for forensic application over other analytical methods in DNA barcoding of Pterocarpus wood.
- We recommend that you also print this page and attach it to the printout of the article, to retain the full citation information.
- This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.
CitationHe, Tuo; Jiao, Lichao; Wiedenhoeft, Alex C.; Yin, Yafang. 2019. Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood. Planta. 249(5): 1617-1625.
KeywordsDNA barcoding, forensic wood identification, identifcation accuracy, machine learning approaches (MLAs), Pterocarpus, SMO classifer
- DNA barcode authentication and library development for the wood of six commercial Pterocarpus species: the critical role of xylarium specimens
- Nutrient and salt relations of Pterocarpus officinalis L. in coastal wetlands of the Caribbean: assessment through leaf and soil analyses.
- Classification of CITES-listed and other neotropical Meliaceae wood images using convolutional neural networks
XML: View XML