Logo image
Generating partially synthetic geocoded public use data with decreased disclosure risk by using differential smoothing
Journal article   Open access

Generating partially synthetic geocoded public use data with decreased disclosure risk by using differential smoothing

Harrison Quick, Scott H. Holan and Christopher K. Wikle
Journal of the Royal Statistical Society. Series A, Statistics in society, v 181(3), pp 649-661
01 Jun 2018
url
https://arxiv.org/abs/1507.05529View

Abstract

Mathematical Methods In Social Sciences Mathematics Physical Sciences Science & Technology Social Sciences Social Sciences, Mathematical Methods Statistics & Probability
When collecting geocoded confidential data with the intent to disseminate, agencies often resort to altering the geographies before making data publicly available. An alternative to releasing aggregated and/or perturbed data is to release synthetic data, where sensitive values are replaced with draws from models designed to capture distributional features in the data collected. The issues associated with spatially outlying observations in the data, however, have received relatively little attention. Our goal here is to shed light on this problem, to propose a solution-referred to as 'differential smoothing'-and to illustrate our approach by using sale prices of homes in San Francisco.

Metrics

7 Record Views
11 citations in Scopus

Details

UN Sustainable Development Goals (SDGs)

This publication has contributed to the advancement of the following goals:

#3 Good Health and Well-Being

InCites Highlights

Data related to this publication, from InCites Benchmarking & Analytics tool:

Collaboration types
Domestic collaboration
Web of Science research areas
Social Sciences, Mathematical Methods
Statistics & Probability
Logo image