Data deduplication techniques for efficient cloud storage management: a systematic review

R Kaur, I Chana, J Bhattacharya - The Journal of Supercomputing, 2018 - Springer
The exponential growth of digital data in cloud storage systems is a critical issue presently
as a large amount of duplicate data in the storage systems exerts an extra load on it …

A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

{FastCDC}: A fast and efficient {Content-Defined} chunking approach for data deduplication

W Xia, Y Zhou, H Jiang, D Feng, Y Hua, Y Hu… - 2016 USENIX Annual …, 2016 - usenix.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems in the past 15 years or so due to its high redundancy detection abil-ity. However …

The design of fast content-defined chunking for data deduplication based storage systems

W Xia, X Zou, H Jiang, Y Zhou, C Liu… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems recently due to its high redundancy detection ability. However, existing CDC-based …

SecDep: A user-aware efficient fine-grained secure deduplication scheme with multi-level key management

Y Zhou, D Feng, W Xia, M Fu, F Huang… - … 31st symposium on …, 2015 - ieeexplore.ieee.org
Nowadays, many customers and enterprises backup their data to cloud storage that
performs deduplication to save storage space and network bandwidth. Hence, how to …

Leveraging data deduplication to improve the performance of primary storage systems in the cloud

B Mao, H Jiang, S Wu, L Tian - … of the 4th annual Symposium on Cloud …, 2013 - dl.acm.org
Recent studies have shown that moderate to high data redundancy exists in primary storage
systems, such as VM-based, enterprise and HPC storage systems, which indicates that the …

CIDR: A cost-effective in-line data reduction system for terabit-per-second scale SSD arrays

M Ajdari, P Park, J Kim, D Kwon… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
An SSD array, a storage system consisting of multiple SSDs per node, has become a design
choice to implement a fast primary storage system, and modern storage architects now aim …

Analytical enumeration of redundant data anomalies in energy consumption readings of smart buildings with a case study of darmstadt smart city in Germany

PP Kasaraneni, VPK Yellapragada, GLK Moganti… - Sustainability, 2022 - mdpi.com
High-quality data are always desirable for superior decision-making in smart buildings.
However, latency issues, communication failures, meter glitches, etc., create data …

Towards dependable and trustworthy outsourced computing: A comprehensive survey and tutorial

M Zhao, C Hu, X Song, C Zhao - Journal of Network and Computer …, 2019 - Elsevier
Cloud computing provides the clients with diversified services in a flexible manner. Recently,
the cloud platforms have been the basic underling-support for The IoT and mobile …

Muse: A multi-tierd and sla-driven deduplication framework for cloud storage systems

J Yin, Y Tang, S Deng, B Zheng… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
For cloud storage service vendors, balancing the client-perceived IO performance and the
self-perceived space cost is always one of the standing challenges. When applying …