Privacy protection versus cluster detection in spatial epidemiology

Am J Public Health. 2006 Nov;96(11):2002-8. doi: 10.2105/AJPH.2005.069526. Epub 2006 Oct 3.

Abstract

Objectives: Patient data that includes precise locations can reveal patients' identities, whereas data aggregated into administrative regions may preserve privacy and confidentiality. We investigated the effect of varying degrees of address precision (exact latitude and longitude vs the center points of zip code or census tracts) on detection of spatial clusters of cases.

Methods: We simulated disease outbreaks by adding supplementary spatially clustered emergency department visits to authentic hospital emergency department syndromic surveillance data. We identified clusters with a spatial scan statistic and evaluated detection rate and accuracy.

Results: More clusters were identified, and clusters were more accurately detected, when exact locations were used. That is, these clusters contained at least half of the simulated points and involved few additional emergency department visits. These results were especially apparent when the synthetic clustered points crossed administrative boundaries and fell into multiple zip code or census tracts.

Conclusions: The spatial cluster detection algorithm performed better when addresses were analyzed as exact locations than when they were analyzed as center points of zip code or census tracts, particularly when the clustered points crossed administrative boundaries. Use of precise addresses offers improved performance, but this practice must be weighed against privacy concerns in the establishment of public health data exchange policies.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Censuses
  • Cluster Analysis*
  • Computer Simulation
  • Confidentiality*
  • Disease Outbreaks / statistics & numerical data*
  • Emergency Service, Hospital / statistics & numerical data*
  • Geographic Information Systems*
  • Geography / classification
  • Humans
  • Postal Service / classification
  • Public Health Informatics / methods*
  • Public Health Informatics / standards
  • Sentinel Surveillance