查看文章

tuwien.ac.at 中的 [PDF]

Utility and Privacy Assessments of Synthetic Data for Regression Tasks

作者

Markus Hittmeir, Andreas Ekelhart, Rudolf Mayer

发表日期

2019/12/9

研讨会论文

2019 IEEE International Conference on Big Data (Big Data)

页码范围

5763-5772

出版商

IEEE

简介

With ever increasing capacity for collecting, storing, and processing of data, there is also a high demand for intelligent data analysis methods. While there have been impressive advances in machine learning and similar domains in recent years, this also gives rise to concerns regarding the protection of personal and otherwise sensitive data, especially if it is to be analysed by third parties. Besides anonymisation, which becomes challenging with high dimensional data, one approach for privacy-preserving data mining lies in the usage of synthetic data, which comes with the promise of protecting the users’ data and producing analysis results close to those achieved by using real data. In this paper, we analyse a number of different approaches for creating synthetic data, and study the utility of the created datasets for regression tasks, i.e. the prediction of a numeric value. We further investigate the similarity of real and …

引用总数

被引用次数：45

202020212022202320241 6 15 17 6

学术搜索中的文章

Utility and privacy assessments of synthetic data for regression tasks

M Hittmeir, A Ekelhart, R Mayer - 2019 IEEE International Conference on Big Data (Big …, 2019

被引用次数：45 相关文章所有 5 个版本