A survey and classification of storage deduplication systems

J Paulo, J Pereira - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
The automatic elimination of duplicate data in a storage system, commonly known as
deduplication, is increasingly accepted as an effective technique to reduce storage costs …

A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

[PDF][PDF] iDedup: latency-aware, inline data deduplication for primary storage.

K Srinivasan, T Bisson, GR Goodson, K Voruganti - Fast, 2012 - usenix.org
Deduplication technologies are increasingly being deployed to reduce cost and increase
space-efficiency in corporate data centers. However, prior research has not applied …

A Full {GPU} Virtualization Solution with Mediated {Pass-Through}

K Tian, Y Dong, D Cowperthwaite - 2014 USENIX Annual Technical …, 2014 - usenix.org
A Full GPU Virtualization Solution with Mediated Pass-Through Page 1 This paper is
included in the Proceedings of USENIX ATC ’14: 2014 USENIX Annual Technical …

Nitro: A {Capacity-Optimized}{SSD} Cache for Primary Storage

C Li, P Shilane, F Douglis, H Shim… - 2014 USENIX Annual …, 2014 - usenix.org
For many primary storage customers, storage must balance the requirements for large
capacity, high performance, and low cost. A well studied technique is to place a solid state …

Accelerating restore and garbage collection in deduplication-based backup systems via exploiting historical information

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - 2014 USENIX Annual …, 2014 - usenix.org
In deduplication-based backup systems, the chunks of each backup are physically scattered
after deduplication, which causes a challenging fragmentation problem. The fragmentation …

Fast and low-RAM-footprint indexing for data deduplication

S Sengupta, B Debnath, J Li, RN Desai… - US Patent …, 2015 - Google Patents
The subject disclosure is directed towards a data deduplica tion technology in which a hash
index services index main tains a hash index in a secondary storage device Such as a hard …

Dblk: Deduplication for primary block storage

Y Tsuchiya, T Watanabe - 2011 IEEE 27th Symposium on …, 2011 - ieeexplore.ieee.org
The deduplication block-device (DBLK) is a deduplication and compression system with a
block device interface. It is used as a primary storage and block-wise deduplication is done …

Reducing impact of data fragmentation caused by in-line deduplication

M Kaczmarczyk, M Barczynski, W Kilian… - Proceedings of the 5th …, 2012 - dl.acm.org
Deduplication results inevitably in data fragmentation, because logically continuous data is
scattered across many disk locations. In this work we focus on fragmentation caused by …

Live deduplication storage of virtual machine images in an open-source cloud

CH Ng, M Ma, TY Wong, PPC Lee, JCS Lui - Middleware 2011: ACM/IFIP …, 2011 - Springer
Deduplication is an approach of avoiding storing data blocks with identical content, and has
been shown to effectively reduce the disk space for storing multi-gigabyte virtual machine …