Skip to Main Content
The importance of data quality for generating reliable distribution models for rare, elusive, and cryptic speciesAuthor(s): Keith B. Aubry; Catherine M. Raley; Kevin S. McKelvey
Source: PLoS One. 12: e0179152.
Publication Series: Scientific Journal (JRNL)
Station: Pacific Northwest Research Station
View PDF (1.0 MB)
DescriptionThe availability of spatially referenced environmental data and species occurrence records in online databases enable practitioners to easily generate species distribution models (SDMs) for a broad array of taxa. Such databases often include occurrence records of unknown reliability, yet little information is available on the influence of data quality on SDMs generated for rare, elusive, and cryptic species that are prone to misidentification in the field. We investigated this question for the fisher (Pekania pennanti), a forest carnivore of conservation concern in the Pacific States that is often confused with the more common Pacific marten (Martes caurina). Fisher occurrence records supported by physical evidence (verifiable records) were available from a limited area, whereas occurrence records of unknown quality (unscreened records) were available from throughout the fisher's historical range. We reserved 20% of the verifiable records to use as a test sample for both models and generated SDMs with each dataset using Maxent. The verifiable model performed substantially better than the unscreened model based on multiple metrics including AUCtest values (0.78 and 0.62, respectively), evaluation of training and test gains, and statistical tests of how well each model predicted test localities. In addition, the verifiable model was consistent with our knowledge of the fisher's habitat relations and potential distribution, whereas the unscreened model indicated a much broader area of high-quality habitat (indices > 0.5) that included large expanses of high-elevation habitat that fishers do not occupy. Because Pacific martens remain relatively common in upper elevation habitats in the Cascade Range and Sierra Nevada, the SDM based on unscreened records likely reflects primarily a conflation of marten and fisher habitat. Consequently, accurate identifications are far more important than the spatial extent of occurrence records for generating reliable SDMs for the fisher in this region. We strongly recommend that practitioners avoid using anecdotal occurrence records to build SDMs but, if such data are used, the validity of resulting models should be tested with verifiable occurrence records.
- You may send email to firstname.lastname@example.org to request a hard copy of this publication.
- (Please specify exactly which publication you are requesting and your mailing address.)
- We recommend that you also print this page and attach it to the printout of the article, to retain the full citation information.
- This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.
CitationAubry, Keith B.; Raley, Catherine M.; McKelvey, Kevin S. 2017. The importance of data quality for generating reliable distribution models for rare, elusive, and cryptic species. PLoS One. 12: e0179152.
Keywordsdata quality, species distribution models (SDMs), fisher, Pekania pennant, Pacific marten, Martes caurina, taxa
- Distribution and broadscale habitat relations of the wolverine in the contiguous United States
- Using occupancy and population models to assess habitat conservation opportunities for an isolated carnivore population
- Spatiotemporal variation in resource selection: Insights from the American marten (Martes Americana)
XML: View XML