A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

Demystifying data deduplication

N Mandagere, P Zhou, MA Smith… - Proceedings of the ACM …, 2008 - dl.acm.org
Effectiveness and tradeoffs of deduplication technologies are not well understood--vendors
tout Deduplication as a" silver bullet" that can help any enterprise optimize its deployed …

Avoiding the disk bottleneck in the data domain deduplication file system.

B Zhu, K Li, RH Patterson - Fast, 2008 - usenix.org
Disk-based deduplication storage has emerged as the new-generation storage system for
enterprise data protection to replace tape libraries. Deduplication removes redundant data …

MAD2: A scalable high-throughput exact deduplication approach for network backup services

J Wei, H Jiang, K Zhou, D Feng - 2010 IEEE 26th Symposium …, 2010 - ieeexplore.ieee.org
Deduplication has been widely used in disk-based secondary storage systems to improve
space efficiency. However, there are two challenges facing scalable high-throughput …

Hpdedup: A hybrid prioritized data deduplication mechanism for primary storage in the cloud

H Wu, C Wang, Y Fu, S Sakr, L Zhu, K Lu - arXiv preprint arXiv:1702.08153, 2017 - arxiv.org
Eliminating duplicate data in primary storage of clouds increases the cost-efficiency of cloud
service providers as well as reduces the cost of users for using cloud services. Existing …

A survey and classification of storage deduplication systems

J Paulo, J Pereira - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
The automatic elimination of duplicate data in a storage system, commonly known as
deduplication, is increasingly accepted as an effective technique to reduce storage costs …

Generating realistic datasets for deduplication analysis

V Tarasov, A Mudrankit, W Buik, P Shilane… - 2012 USENIX Annual …, 2012 - usenix.org
Deduplication is a popular component of modern storage systems, with a wide variety of
approaches. Unlike traditional storage systems, deduplication performance depends on …

Data duplication that mitigates storage requirements

YX Li, YM Li, MG Sisco, X Xu - US Patent 9,645,754, 2017 - Google Patents
BACKGROUND Data backup is critical to storage systems. In backup systems, however,
data deduplication technology is a newly introduced technical Solution for reducing storage …

[PDF][PDF] {SiLo}: A {Similarity-Locality} based {Near-Exact} deduplication scheme with low {RAM} overhead and high throughput

W Xia, H Jiang, D Feng, Y Hua - 2011 USENIX Annual Technical …, 2011 - usenix.org
Data Deduplication is becoming increasingly popular in storage systems as a space-efficient
approach to data backup and archiving. Most existing state-of-the-art deduplication methods …

[PDF][PDF] Building a high-performance deduplication system

F Guo, P Efstathopoulos - 2011 USENIX Annual Technical Conference …, 2011 - usenix.org
Modern deduplication has become quite effective at eliminating duplicates in data, thus
multiplying the effective capacity of disk-based backup systems, and enabling them as …