Data deduplication techniques for efficient cloud storage management: a systematic review

R Kaur, I Chana, J Bhattacharya - The Journal of Supercomputing, 2018 - Springer
The exponential growth of digital data in cloud storage systems is a critical issue presently
as a large amount of duplicate data in the storage systems exerts an extra load on it …

A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

{FastCDC}: A fast and efficient {Content-Defined} chunking approach for data deduplication

W Xia, Y Zhou, H Jiang, D Feng, Y Hua, Y Hu… - 2016 USENIX Annual …, 2016 - usenix.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems in the past 15 years or so due to its high redundancy detection abil-ity. However …

AE: An asymmetric extremum content defined chunking algorithm for fast and bandwidth-efficient data deduplication

Y Zhang, H Jiang, D Feng, W Xia, M Fu… - … IEEE Conference on …, 2015 - ieeexplore.ieee.org
Data deduplication, a space-efficient and bandwidth-saving technology, plays an important
role in bandwidth-efficient data transmission in various data-intensive network and cloud …

The design of fast content-defined chunking for data deduplication based storage systems

W Xia, X Zou, H Jiang, Y Zhou, C Liu… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems recently due to its high redundancy detection ability. However, existing CDC-based …

The design of fast and lightweight resemblance detection for efficient post-deduplication delta compression

W Xia, L Pu, X Zou, P Shilane, S Li, H Zhang… - ACM Transactions on …, 2023 - dl.acm.org
Post-deduplication delta compression is a data reduction technique that calculates and
stores the differences of very similar but non-duplicate chunks in storage systems, which is …

Finesse:{Fine-Grained} Feature Locality based Fast Resemblance Detection for {Post-Deduplication} Delta Compression

Y Zhang, W Xia, D Feng, H Jiang, Y Hua… - 17th USENIX Conference …, 2019 - usenix.org
In storage systems, delta compression is often used as a complementary data reduction
technique for data deduplication because it is able to eliminate redundancy among the non …

Building a high-performance fine-grained deduplication framework for backup storage with high deduplication ratio

X Zou, W Xia, P Shilane, H Zhang, X Wang - 2022 USENIX Annual …, 2022 - usenix.org
Fine-grained deduplication, which first removes identical chunks and then eliminates
redundancies between similar but non-identical chunks (ie, delta compression), could …

LOFS: A lightweight online file storage strategy for effective data deduplication at network edge

G Cheng, D Guo, L Luo, J Xia… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Edge computing responds to users' requests with low latency by storing the relevant files at
the network edge. Various data deduplication technologies are currently employed at edge …

Accelerating content-defined-chunking based data deduplication by exploiting parallelism

W Xia, D Feng, H Jiang, Y Zhang, V Chang… - Future Generation …, 2019 - Elsevier
Data deduplication, a data reduction technique that efficiently detects and eliminates
redundant data chunks and files, has been widely applied in large-scale storage systems …