Nonparametric intensity bounds for the delineation of spatial clusters

Int J Health Geogr. 2011 Jan 7:10:1. doi: 10.1186/1476-072X-10-1.

Abstract

Background: There is considerable uncertainty in the disease rate estimation for aggregated area maps, especially for small population areas. As a consequence the delineation of local clustering is subject to substantial variation. Consider the most likely disease cluster produced by any given method, like SaTScan, for the detection and inference of spatial clusters in a map divided into areas; if this cluster is found to be statistically significant, what could be said of the external areas adjacent to the cluster? Do we have enough information to exclude them from a health program of prevention? Do all the areas inside the cluster have the same importance from a practitioner perspective?

Results: We propose a method to measure the plausibility of each area being part of a possible localized anomaly in the map. In this work we assess the problem of finding error bounds for the delineation of spatial clusters in maps of areas with known populations and observed number of cases. A given map with the vector of real data (the number of observed cases for each area) shall be considered as just one of the possible realizations of the random variable vector with an unknown expected number of cases. The method is tested in numerical simulations and applied for three different real data maps for sharply and diffusely delineated clusters. The intensity bounds found by the method reflect the degree of geographic focus of the detected clusters.

Conclusions: Our technique is able to delineate irregularly shaped and multiple clusters, making use of simple tools like the circular scan. Intensity bounds for the delineation of spatial clusters are obtained and indicate the plausibility of each area belonging to the real cluster. This tool employs simple mathematical concepts and interpreting the intensity function is very intuitive in terms of the importance of each area in delineating the possible anomalies of the map of rates. The Monte Carlo simulation requires an effort similar to the circular scan algorithm, and therefore it is quite fast. We hope that this tool should be useful in public health decision making of which areas should be prioritized.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Brazil / epidemiology
  • Breast Neoplasms / epidemiology
  • Chagas Disease / epidemiology
  • Data Interpretation, Statistical*
  • Epidemiologic Methods*
  • Female
  • Geography
  • Homicide / statistics & numerical data
  • Humans
  • Likelihood Functions
  • Monte Carlo Method
  • Population Surveillance / methods*
  • Risk
  • Small-Area Analysis*
  • Space-Time Clustering*
  • Statistics, Nonparametric*
  • United States / epidemiology