ADMAD: Application-driven metadata aware de-duplication archival storage system

C Liu, Y Lu, C Shi, G Lu, DHC Du… - 2008 Fifth IEEE …, 2008 - ieeexplore.ieee.org
There is a huge amount of duplicated or redundant data in current storage systems. So data
de-duplication, which uses lossless data compression schemes to minimize the duplicated …

A novel optimization method to improve de-duplication storage system performance

C Liu, Y Xue, D Ju, D Wang - 2009 15th International …, 2009 - ieeexplore.ieee.org
Data De-duplication has become a commodity component in data-intensive storage
systems. But compared with other traditional storage paradigms, de-duplication system …

Semantic data de-duplication for archival storage systems

C Liu, D Ju, Y Gu, Y Zhang, D Wang… - 2008 13th Asia-Pacific …, 2008 - ieeexplore.ieee.org
In archival storage systems, there is a huge amount of duplicate data or redundant data,
which occupy significant extra equipments and power consumptions, largely lowering down …

DEBAR: A scalable high-performance de-duplication storage system for backup and archiving

T Yang, H Jiang, D Feng, Z Niu… - … on Parallel & …, 2010 - ieeexplore.ieee.org
Driven by the increasing demand for large-scale and high-performance data protection, disk-
based de-duplication storage has become a new research focus of the storage industry and …

P-dedupe: Exploiting parallelism in data deduplication system

W Xia, H Jiang, D Feng, L Tian, M Fu… - 2012 IEEE Seventh …, 2012 - ieeexplore.ieee.org
Data deduplication, an efficient space reduction method, has gained increasing attention
and popularity in data-intensive storage systems. Most existing state-of-the-art deduplication …

MAD2: A scalable high-throughput exact deduplication approach for network backup services

J Wei, H Jiang, K Zhou, D Feng - 2010 IEEE 26th Symposium …, 2010 - ieeexplore.ieee.org
Deduplication has been widely used in disk-based secondary storage systems to improve
space efficiency. However, there are two challenges facing scalable high-throughput …

Characterizing datasets for data deduplication in backup applications

N Park, DJ Lilja - IEEE International Symposium on Workload …, 2010 - ieeexplore.ieee.org
The compression and throughput performance of data deduplication system is directly
affected by the input dataset. We propose two sets of evaluation metrics, and the means to …

SAM: A semantic-aware multi-tiered source de-duplication framework for cloud backup

Y Tan, H Jiang, D Feng, L Tian, Z Yan… - … on Parallel Processing, 2010 - ieeexplore.ieee.org
Existing de-duplication solutions in cloud backup environment either obtain high
compression ratios at the cost of heavy de-duplication overheads in terms of increased …

POD: Performance oriented I/O deduplication for primary storage systems in the cloud

B Mao, H Jiang, S Wu, L Tian - 2014 IEEE 28th International …, 2014 - ieeexplore.ieee.org
Recent studies have shown that moderate to high data redundancy clearly exists in primary
storage systems in the Cloud. Our experimental studies reveal that data redundancy exhibits …

Assuring demanded read performance of data deduplication storage with backup datasets

YJ Nam, D Park, DHC Du - 2012 IEEE 20th International …, 2012 - ieeexplore.ieee.org
Data deduplication has been widely adopted in contemporary backup storage systems. It not
only saves storage space considerably, but also shortens the data backup time significantly …