P-dedupe: Exploiting parallelism in data deduplication system

W Xia, H Jiang, D Feng, L Tian, M Fu… - 2012 IEEE Seventh …, 2012 - ieeexplore.ieee.org
Data deduplication, an efficient space reduction method, has gained increasing attention
and popularity in data-intensive storage systems. Most existing state-of-the-art deduplication …

A novel optimization method to improve de-duplication storage system performance

C Liu, Y Xue, D Ju, D Wang - 2009 15th International …, 2009 - ieeexplore.ieee.org
Data De-duplication has become a commodity component in data-intensive storage
systems. But compared with other traditional storage paradigms, de-duplication system …

[PDF][PDF] Accelerating data deduplication by exploiting pipelining and parallelism with multicore or manycore processors

W Xia, H Jiang, D Feng, L Tian - Proc. 10th USENIX Conf. File Storage …, 2012 - usenix.org
As the amount of the digital data grows explosively, Data deduplication has gained
increasing attention for its space-efficient functionality that not only reduces the storage …

DEBAR: A scalable high-performance de-duplication storage system for backup and archiving

T Yang, H Jiang, D Feng, Z Niu… - … on Parallel & …, 2010 - ieeexplore.ieee.org
Driven by the increasing demand for large-scale and high-performance data protection, disk-
based de-duplication storage has become a new research focus of the storage industry and …

[PDF][PDF] {SiLo}: A {Similarity-Locality} based {Near-Exact} deduplication scheme with low {RAM} overhead and high throughput

W Xia, H Jiang, D Feng, Y Hua - 2011 USENIX Annual Technical …, 2011 - usenix.org
Data Deduplication is becoming increasingly popular in storage systems as a space-efficient
approach to data backup and archiving. Most existing state-of-the-art deduplication methods …

MAD2: A scalable high-throughput exact deduplication approach for network backup services

J Wei, H Jiang, K Zhou, D Feng - 2010 IEEE 26th Symposium …, 2010 - ieeexplore.ieee.org
Deduplication has been widely used in disk-based secondary storage systems to improve
space efficiency. However, there are two challenges facing scalable high-throughput …

dedupv1: Improving deduplication throughput using solid state drives (SSD)

D Meister, A Brinkmann - 2010 IEEE 26th Symposium on Mass …, 2010 - ieeexplore.ieee.org
Data deduplication systems discover and remove redundancies between data blocks. The
search for redundant data blocks is often based on hashing the content of a block and …

A scalable parallel deduplication algorithm

W Santos, T Teixeira, C Machado… - … (SBAC-PAD'07), 2007 - ieeexplore.ieee.org
The identification of replicas in a database is fundamental to improve the quality of the
information. Deduplication is the task of identifying replicas in a database that refer to the …

ADMAD: Application-driven metadata aware de-duplication archival storage system

C Liu, Y Lu, C Shi, G Lu, DHC Du… - 2008 Fifth IEEE …, 2008 - ieeexplore.ieee.org
There is a huge amount of duplicated or redundant data in current storage systems. So data
de-duplication, which uses lossless data compression schemes to minimize the duplicated …

POD: Performance oriented I/O deduplication for primary storage systems in the cloud

B Mao, H Jiang, S Wu, L Tian - 2014 IEEE 28th International …, 2014 - ieeexplore.ieee.org
Recent studies have shown that moderate to high data redundancy clearly exists in primary
storage systems in the Cloud. Our experimental studies reveal that data redundancy exhibits …