Journal article
Using spatiotemporal models to generate synthetic data for public use
Spatial and spatio-temporal epidemiology, v 27
01 Nov 2018
PMID: 30409375
Featured in Collection : UN Sustainable Development Goals @ Drexel
Abstract
When agencies release public-use data, they must be cognizant of the potential risk of disclosure associated with making their data publicly available. This issue is particularly pertinent in disease mapping, where small counts pose both inferential challenges and potential disclosure risks. While the small area estimation, disease mapping, and statistical disclosure limitation literatures are individually robust, there have been few intersections between them. Here, we formally propose the use of spatiotemporal data analysis methods to generate synthetic data for public use. Specifically, we analyze ten years of county-level heart disease death counts for multiple age-groups using a Bayesian model that accounts for dependence spatially, temporally, and between age-groups; generating synthetic data from the resulting posterior predictive distribution will preserve these dependencies. After demonstrating the synthetic data's privacy-preserving features, we illustrate their utility by comparing estimates of urban/rural disparities from the synthetic data to those from data with small counts suppressed. (C) 2018 Elsevier Ltd. All rights reserved.
Metrics
Details
- Title
- Using spatiotemporal models to generate synthetic data for public use
- Creators
- Harrison Quick - Drexel UniversityLance A. Waller - Emory University
- Publication Details
- Spatial and spatio-temporal epidemiology, v 27
- Publisher
- Elsevier
- Number of pages
- 9
- Resource Type
- Journal article
- Language
- English
- Academic Unit
- Epidemiology and Biostatistics
- Web of Science ID
- WOS:000449303700005
- Scopus ID
- 2-s2.0-85053423466
- Other Identifier
- 991019168622904721
UN Sustainable Development Goals (SDGs)
This publication has contributed to the advancement of the following goals:
InCites Highlights
Data related to this publication, from InCites Benchmarking & Analytics tool:
- Collaboration types
- Domestic collaboration
- Web of Science research areas
- Public, Environmental & Occupational Health