作者
Markus Hittmeir, Rudolf Mayer, Andreas Ekelhart
发表日期
2020/3/16
图书
Proceedings of the Tenth ACM Conference on Data and Application Security and Privacy
页码范围
133-143
简介
The generation of synthetic data is widely considered as viable method for alleviating privacy concerns and for reducing identification and attribute disclosure risk in micro-data. The records in a synthetic dataset are artificially created and thus do not directly relate to individuals in the original data in terms of a 1-to-1 correspondence. As a result, inferences about said individuals appear to be infeasible and, simultaneously, the utility of the data may be kept at a high level. In this paper, we challenge this belief by interpreting the standard attacker model for attribute disclosure as classification problem. We show how disclosure risk measures presented in recent publications may be compared to or even be reformulated as machine learning classification models. Our overall goal is to empirically analyze attribute disclosure risk in synthetic data and to discuss its close relationship to data utility. Moreover, we improve the …
引用总数
2020202120222023202444976
学术搜索中的文章
M Hittmeir, R Mayer, A Ekelhart - Proceedings of the Tenth ACM Conference on Data …, 2020