Abstract
Cluster analysis continues to be an important exploratory technique in scientific inquiry. It is used widely in geography, public health, criminology, ecology, and many other fields. Spatial cluster detection is driven by geographic information corresponding to the location of activities, requiring appropriate and meaningful treatment of space and spatial relationships combined with observed attributes of location and events. To date, this has meant utilizing dedicated measures and techniques to structure and account for distance, neighbors, contiguity, irregular geographic morphology, and so on. Unfortunately, all spatial clustering approaches, regardless of their theoretical underpinning, statistical foundation, or mathematical specification, have limitations in accuracy, sensitivity, and the computational effort required for identifying clusters. As a result, a major challenge in practice is determining which technique(s) will provide the most meaningful insights for a particular substantive issue or planning context. The purpose of this article is to provide an overview and evaluation of spatial clustering techniques, identifying the strengths and weaknesses of the most widely applied approaches. Results suggest that performance varies significantly in terms of accuracy, sensitivity, and computational expense. This is noteworthy because the misidentification of clusters, whether false positives or false negatives, has the potential to bias not only hypothesis formulation but also pragmatic facets of policy, process, and planning efforts within a region.
Original language | English (US) |
---|---|
Pages (from-to) | 1134-1156 |
Number of pages | 23 |
Journal | Annals of the Association of American Geographers |
Volume | 104 |
Issue number | 6 |
DOIs | |
State | Published - Dec 1 2014 |
Externally published | Yes |
Keywords
- cluster analysis
- hot spots
- knowledge discovery
- method selection
- scale
ASJC Scopus subject areas
- Geography, Planning and Development
- Earth-Surface Processes