A comprehensive study of the past, present, and future of data deduplication

W Xia, H Jiang, D Feng, F Douglis… - Proceedings of the …, 2016 - ieeexplore.ieee.org
Data deduplication, an efficient approach to data reduction, has gained increasing attention
and popularity in large-scale storage systems due to the explosive growth of digital data. It …

[PDF][PDF] Characteristics of backup workloads in production systems.

G Wallace, F Douglis, H Qian, P Shilane, S Smaldone… - FAST, 2012 - usenix.org
2009 EMC Template Sample 24 Point Arial Regular Page 1 © Copyright 2012 EMC
Corporation. All rights reserved. CHARACTERISTICS OF BACKUP WORKLOADS IN …

Improving restore speed for backup systems that use inline {Chunk-Based} deduplication

M Lillibridge, K Eshghi, D Bhagwat - 11th USENIX Conference on File …, 2013 - usenix.org
Slow restoration due to chunk fragmentation is a serious problem facing inline chunk-based
data deduplication systems: restore speeds for the most recent backup can drop orders of …

Design tradeoffs for data deduplication performance in backup workloads

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - … USENIX Conference on …, 2015 - usenix.org
Data deduplication has become a standard component in modern backup systems. In order
to understand the fundamental tradeoffs in each of its design choices (such as prefetching …

Accelerating restore and garbage collection in deduplication-based backup systems via exploiting historical information

M Fu, D Feng, Y Hua, X He, Z Chen, W Xia… - 2014 USENIX Annual …, 2014 - usenix.org
In deduplication-based backup systems, the chunks of each backup are physically scattered
after deduplication, which causes a challenging fragmentation problem. The fragmentation …

The dilemma between deduplication and locality: Can both be achieved?

X Zou, J Yuan, P Shilane, W Xia, H Zhang… - … USENIX conference on …, 2021 - usenix.org
Data deduplication is widely used to reduce the size of backup workloads, but it has the
known disadvantage of causing poor data locality, also referred to as the fragmentation …

{ALACC}: Accelerating Restore Performance of Data Deduplication Systems Using Adaptive {Look-Ahead} Window Assisted Chunk Caching

Z Cao, H Wen, F Wu, DHC Du - 16th USENIX Conference on File and …, 2018 - usenix.org
Data deduplication has been widely applied in storage systems to improve the efficiency of
space utilization. In data deduplication systems, the data restore performance is seriously …

Assuring demanded read performance of data deduplication storage with backup datasets

YJ Nam, D Park, DHC Du - 2012 IEEE 20th International …, 2012 - ieeexplore.ieee.org
Data deduplication has been widely adopted in contemporary backup storage systems. It not
only saves storage space considerably, but also shortens the data backup time significantly …

Sliding {Look-Back} Window Assisted Data Chunk Rewriting for Improving Deduplication Restore Performance

Z Cao, S Liu, F Wu, G Wang, B Li, DHC Du - 17th USENIX Conference …, 2019 - usenix.org
Data deduplication is an effective way of improving storage space utilization. The data
generated by deduplication is persistently stored in data chunks or data containers (a …

{LoopDelta}: Embedding Locality-aware Opportunistic Delta Compression in Inline Deduplication for Highly Efficient Data Reduction

Y Zhang, H Jiang, D Feng, N Jiang, T Qiu… - 2023 USENIX Annual …, 2023 - usenix.org
As a complement to data deduplication, delta compression further reduces the data volume
by compressing non-duplicate data chunks relative to their similar chunks (base chunks) …