Skip to Main Content
U.S. Forest Service
Caring for the land and serving people

United States Department of Agriculture

Home > Search > Publication Information

  1. Share via EmailShare on FacebookShare on LinkedInShare on Twitter
    Dislike this pubLike this pub
    Author(s): Bianca N. I. Eskelson; Hailemariam Temesgen; Valerie Lemay; Tara M. BarrettNicholas L. CrookstonAndrew T. Hudak
    Date: 2009
    Source: Scandinavian Journal of Forest Research. 24: 235-246.
    Publication Series: Scientific Journal (JRNL)
    Station: Rocky Mountain Research Station
    PDF: View PDF  (110.63 KB)

    Description

    Almost universally, forest inventory and monitoring databases are incomplete, ranging from missing data for only a few records and a few variables, common for small land areas, to missing data for many observations and many variables, common for large land areas. For a wide variety of applications, nearest neighbor (NN) imputation methods have been developed to fill in observations of variables that are missing on some records (Y-variables), using related variables that are available for all records (X-variables). This review attempts to summarize the advantages and weaknesses of NN imputation methods and to give an overview of the NN approaches that have most commonly been used. It also discusses some of the challenges of NN imputation methods. The inclusion of NN imputation methods into standard software packages and the use of consistent notation may improve further development of NN imputation methods. Using X-variables from different data sources provides promising results, but raises the issue of spatial and temporal registration errors. Quantitative measures of the contribution of individual X-variables to the accuracy of imputing the Y-variables are needed. In addition, further research is warranted to verify statistical properties, modify methods to improve statistical properties, and provide variance estimators.

    Publication Notes

    • You may send email to rmrspubrequest@fs.fed.us to request a hard copy of this publication.
    • (Please specify exactly which publication you are requesting and your mailing address.)
    • We recommend that you also print this page and attach it to the printout of the article, to retain the full citation information.
    • This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.

    Citation

    Eskelson, Bianca N. I.; Temesgen, Hailemariam; Lemay, Valerie; Barrett, Tara M.; Crookston, Nicholas L.; Hudak, Andrew T. 2009. The roles of nearest neighbor methods in imputing missing data in forest inventory and monitoring databases. Scandinavian Journal of Forest Research. 24: 235-246.

    Keywords

    consistent notation, forest measurements, input data for forest planning, nearest neighbor imputation, registration error, sources of X-variables

    Related Search


    XML: View XML
Show More
Show Fewer
Jump to Top of Page
https://www.fs.usda.gov/treesearch/pubs/33623