Anonymization techniques for privacy preserving data publishing: A comprehensive survey

A Majeed, S Lee - IEEE access, 2020 - ieeexplore.ieee.org
Anonymization is a practical solution for preserving user's privacy in data publishing. Data
owners such as hospitals, banks, social network (SN) service providers, and insurance …

A software engineering perspective on engineering machine learning systems: State of the art and challenges

G Giray - Journal of Systems and Software, 2021 - Elsevier
Context: Advancements in machine learning (ML) lead to a shift from the traditional view of
software development, where algorithms are hard-coded by humans, to ML systems …

[HTML][HTML] A multifaceted benchmarking of synthetic electronic health record generation models

C Yan, Y Yan, Z Wan, Z Zhang, L Omberg… - Nature …, 2022 - nature.com
Synthetic health data have the potential to mitigate privacy concerns in supporting
biomedical research and healthcare applications. Modern approaches for data generation …

Using gans for sharing networked time series data: Challenges, initial promise, and open questions

Z Lin, A Jain, C Wang, G Fanti, V Sekar - Proceedings of the ACM …, 2020 - dl.acm.org
Limited data access is a longstanding barrier to data-driven research and development in
the networked systems community. In this work, we explore if and how generative …

[HTML][HTML] Measuring large-scale social networks with high resolution

A Stopczynski, V Sekara, P Sapiezynski, A Cuttone… - PloS one, 2014 - journals.plos.org
This paper describes the deployment of a large-scale study designed to measure human
interactions across a variety of communication channels, with high temporal resolution and …

Gradient-leakage resilient federated learning

W Wei, L Liu, Y Wu, G Su… - 2021 IEEE 41st …, 2021 - ieeexplore.ieee.org
Federated learning (FL) is an emerging distributed learning paradigm with default client
privacy because clients can keep sensitive data on their devices and only share local …

Anonymization of location data does not work: A large-scale measurement study

H Zang, J Bolot - Proceedings of the 17th annual international …, 2011 - dl.acm.org
We examine a very large-scale data set of more than 30 billion call records made by 25
million cell phone users across all 50 states of the US and attempt to determine to what …

Utility-privacy tradeoffs in databases: An information-theoretic approach

L Sankar, SR Rajagopalan… - IEEE Transactions on …, 2013 - ieeexplore.ieee.org
Ensuring the usefulness of electronic data sources while providing necessary privacy
guarantees is an important unsolved problem. This problem drives the need for an analytical …

Slicing: A new approach for privacy preserving data publishing

T Li, N Li, J Zhang, I Molloy - IEEE transactions on knowledge …, 2010 - ieeexplore.ieee.org
Several anonymization techniques, such as generalization and bucketization, have been
designed for privacy preserving microdata publishing. Recent work has shown that …

Privacy-and utility-preserving textual analysis via calibrated multivariate perturbations

O Feyisetan, B Balle, T Drake, T Diethe - … on web search and data mining, 2020 - dl.acm.org
Accurately learning from user data while providing quantifiable privacy guarantees provides
an opportunity to build better ML models while maintaining user trust. This paper presents a …