Data deduplication techniques for efficient cloud storage management: a systematic review

R Kaur, I Chana, J Bhattacharya - The Journal of Supercomputing, 2018 - Springer
The exponential growth of digital data in cloud storage systems is a critical issue presently
as a large amount of duplicate data in the storage systems exerts an extra load on it …

A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

A power controlled multiple access protocol for wireless packet networks

JP Monks, V Bharghavan… - … IEEE INFOCOM 2001 …, 2001 - ieeexplore.ieee.org
Multiple access-based collision avoidance MAC protocols have typically used fixed
transmission power, and have not considered power control mechanisms based on the …

Design tradeoffs for data deduplication performance in backup workloads

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - … USENIX Conference on …, 2015 - usenix.org
Data deduplication has become a standard component in modern backup systems. In order
to understand the fundamental tradeoffs in each of its design choices (such as prefetching …

{FastCDC}: A fast and efficient {Content-Defined} chunking approach for data deduplication

W Xia, Y Zhou, H Jiang, D Feng, Y Hua, Y Hu… - 2016 USENIX Annual …, 2016 - usenix.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems in the past 15 years or so due to its high redundancy detection abil-ity. However …

Wan-optimized replication of backup datasets using stream-informed delta compression

P Shilane, M Huang, G Wallace, W Hsu - ACM Transactions on Storage …, 2012 - dl.acm.org
Replicating data off site is critical for disaster recovery reasons, but the current approach of
transferring tapes is cumbersome and error prone. Replicating across a wide area network …

Method and apparatus for determining optimal chunk sizes of a deduplicated storage system

F Douglis, PN Shilane, G Wallace - US Patent 8,639,669, 2014 - Google Patents
BACKGROUND In a deduplicating storage system, content is typically divided into variable-
sized" chunks' based on characteristics of the data. If a hash of a chunk, also known as a …

Optimizing data block size for deduplication

T Ram - US Patent 9,626,373, 2017 - Google Patents
Provided herein is technology relating to data deduplication and particularly, but not
exclusively, to methods and systems for determining an efficiently optimal size of data blocks …

The design of fast content-defined chunking for data deduplication based storage systems

W Xia, X Zou, H Jiang, Y Zhou, C Liu… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems recently due to its high redundancy detection ability. However, existing CDC-based …

The dilemma between deduplication and locality: Can both be achieved?

X Zou, J Yuan, P Shilane, W Xia, H Zhang… - … USENIX conference on …, 2021 - usenix.org
Data deduplication is widely used to reduce the size of backup workloads, but it has the
known disadvantage of causing poor data locality, also referred to as the fragmentation …