查看文章

researchgate.net 中的 [PDF]

A Baseline for Attribute Disclosure Risk in Synthetic Data

作者

Markus Hittmeir, Rudolf Mayer, Andreas Ekelhart

发表日期

2020/3/16

图书

Proceedings of the Tenth ACM Conference on Data and Application Security and Privacy

页码范围

133-143

简介

The generation of synthetic data is widely considered as viable method for alleviating privacy concerns and for reducing identification and attribute disclosure risk in micro-data. The records in a synthetic dataset are artificially created and thus do not directly relate to individuals in the original data in terms of a 1-to-1 correspondence. As a result, inferences about said individuals appear to be infeasible and, simultaneously, the utility of the data may be kept at a high level. In this paper, we challenge this belief by interpreting the standard attacker model for attribute disclosure as classification problem. We show how disclosure risk measures presented in recent publications may be compared to or even be reformulated as machine learning classification models. Our overall goal is to empirically analyze attribute disclosure risk in synthetic data and to discuss its close relationship to data utility. Moreover, we improve the …

引用总数

被引用次数：31

202020212022202320244 4 9 7 6

学术搜索中的文章

A baseline for attribute disclosure risk in synthetic data

M Hittmeir, R Mayer, A Ekelhart - Proceedings of the Tenth ACM Conference on Data …, 2020

被引用次数：31 相关文章所有 4 个版本