Cluster Analysis-Based Approaches for Geospatiotemporal Data Mining of Massive Data Sets for Identification of Forest Threats

  • Authors: Mills, Richard Trans; Hoffman, Forrest M; Kumar, Jitendra; Hargrove, William W.
  • Publication Year: 2011
  • Publication Series: Scientific Journal (JRNL)
  • Source: Procedia Computer Science 4:1612-1621


We investigate methods for geospatiotemporal data mining of multi-year land surface phenology data (250 m2 Normalized Difference Vegetation Index (NDVI) values derived from the Moderate Resolution Imaging Spectrometer (MODIS) in this study) for the conterminous United States (CONUS) as part of an early warning system for detecting threats to forest ecosystems. The approaches explored here are based on k-means cluster analysis of this massive data set, which provides a basis for defining the bounds of the expected or “normal” phenological patterns that indicate healthy vegetation at a given geographic location. We briefly describe the computational approaches we have used to make cluster analysis of such massive data sets feasible, describe approaches we have explored for distinguishing between normal and abnormal phenology, and present some examples in which we have applied these approaches to identify various forest disturbances in the CONUS.

  • Citation: Mills, Richard Trans.; Hoffman, Forrest M.; Kumar, Jitendra; Hargrove, William W 2011. Cluster Analysis-Based Approaches for Geospatiotemporal Data Mining of Massive Data Sets for Identification of Forest Threats. Procedia Computer Science 4:1612-1621.
  • Keywords: phenology, MODIS, NDVI, remote sensing, k-means clustering, data mining, anomaly detection, high performance computing
  • Posted Date: August 4, 2011
  • Modified Date: August 8, 2011
  • Print Publications Are No Longer Available

    In an ongoing effort to be fiscally responsible, the Southern Research Station (SRS) will no longer produce and distribute hard copies of our publications. Many SRS publications are available at cost via the Government Printing Office (GPO). Electronic versions of publications may be downloaded, printed, and distributed.

    Publication Notes

    • This article was written and prepared by U.S. Government employees on official time, and is therefore in the public domain.
    • Our online publications are scanned and captured using Adobe Acrobat. During the capture process some typographical errors may occur. Please contact the SRS webmaster if you notice any errors which make this publication unusable.
    • To view this article, download the latest version of Adobe Acrobat Reader.