Why Data Anonymization Has Not Taken Off

Matthew J Schneider; James Bailie; Dawn Iacobucci

doi:10.48550/arxiv.2509.10165

Back

Why Data Anonymization Has Not Taken Off

Preprint

Open access

Why Data Anonymization Has Not Taken Off

Matthew J Schneider, James Bailie and Dawn Iacobucci

ArXiv.org

12 Sep 2025

DOI: https://doi.org/10.48550/arxiv.2509.10165

Files and links (1)

url

https://arxiv.org/pdf/2509.10165View

Open

Abstract

Computer Science - Computers and Society

Computer Science - Cryptography and Security

Companies are looking to data anonymization research $\unicode{x2013}$ including differential private and synthetic data methods $\unicode{x2013}$ for simple and straightforward compliance solutions. But data anonymization has not taken off in practice because it is anything but simple to implement. For one, it requires making complex choices which are case dependent, such as the domain of the dataset to anonymize; the units to protect; the scope where the data protection should extend to; and the standard of protection. Each variation of these choices changes the very meaning, as well as the practical implications, of differential privacy (or of any other measure of data anonymization). Yet differential privacy is frequently being branded as the same privacy guarantee regardless of variations in these choices. Some data anonymization methods can be effective, but only when the insights required are much larger than the unit of protection. Given that businesses care about profitability, any solution must preserve the patterns between a firm's data and that profitability. As a result, data anonymization solutions usually need to be bespoke and case-specific, which reduces their scalability. Companies should not expect easy wins, but rather recognize that anonymization is just one approach to data privacy with its own particular advantages and drawbacks, while the best strategies jointly leverage the full range of approaches to data privacy and security in combination.

Metrics

1 Record Views

Details

Title: Why Data Anonymization Has Not Taken Off
Creators: Matthew J Schneider
James Bailie
Dawn Iacobucci
Publication Details: ArXiv.org
Resource Type: Preprint
Language: English
Academic Unit: Decision Sciences (and Management Information Systems)
Other Identifier: 991022097406004721

Why Data Anonymization Has Not Taken Off

Files and links (1)

Abstract

Metrics

Details

Drexel University Social media