[PDF][PDF] Building a high-performance deduplication system

F Guo, P Efstathopoulos - 2011 USENIX Annual Technical Conference …, 2011 - usenix.org
Modern deduplication has become quite effective at eliminating duplicates in data, thus
multiplying the effective capacity of disk-based backup systems, and enabling them as …

Dynamic data deduplication in cloud storage

W Leesakul, P Townend, J Xu - 2014 IEEE 8th International …, 2014 - ieeexplore.ieee.org
Cloud computing plays a major role in the business domain today as computing resources
are delivered as a utility on demand to customers over the Internet. Cloud storage is one of …

A survey and comparative study of data deduplication techniques

J Malhotra, J Bakal - 2015 International Conference on …, 2015 - ieeexplore.ieee.org
Increase in enormous amount of digital data needs more storage space, which in turn
significantly increases the cost of backup and its performance. Traditional backup solutions …

QuickDedup: Efficient VM deduplication in cloud computing environments

S Saharan, G Somani, G Gupta, R Verma… - Journal of Parallel and …, 2020 - Elsevier
Deduplication is one of the major storage optimisation techniques for Virtual Machines
(VMs) in cloud environment. Usually, hashing of blocks helps in identifying duplicate data …

The dilemma between deduplication and locality: Can both be achieved?

X Zou, J Yuan, P Shilane, W Xia, H Zhang… - … USENIX conference on …, 2021 - usenix.org
Data deduplication is widely used to reduce the size of backup workloads, but it has the
known disadvantage of causing poor data locality, also referred to as the fragmentation …

[PDF][PDF] {ViDeDup}: An {Application-Aware} Framework for Video De-duplication

A Katiyar, J Weissman - 3rd Workshop on Hot Topics in Storage and File …, 2011 - usenix.org
Key to the compression-capability of a data deduplication system is the definition of
redundancy. Traditionally, two data items are considered redundant if their underlying bit …

Lipa: A learning-based indexing and prefetching approach for data deduplication

G Xu, B Tang, H Lu, Q Yu… - 2019 35th Symposium on …, 2019 - ieeexplore.ieee.org
In this paper, we present a learning based data deduplication algorithm, called LIPA, which
uses the reinforcement learning framework to build an adaptive indexing structure. It is …

Design tradeoffs for data deduplication performance in backup workloads

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - … USENIX Conference on …, 2015 - usenix.org
Data deduplication has become a standard component in modern backup systems. In order
to understand the fundamental tradeoffs in each of its design choices (such as prefetching …

On information leakage in deduplicated storage systems

H Ritzdorf, G Karame, C Soriente… - … of the 2016 ACM on Cloud …, 2016 - dl.acm.org
Most existing cloud storage providers rely on data deduplication in order to significantly save
storage costs by storing duplicate data only once. While the literature has thoroughly …

Resemblance and mergence based indexing for high performance data deduplication

P Zhang, P Huang, X He, H Wang, K Zhou - Journal of Systems and …, 2017 - Elsevier
Data deduplication, a data redundancy elimination technique, has been widely employed in
many application environments to reduce data storage space. However, it is challenging to …