作者
Min Fu, Dan Feng, Yu Hua, Xubin He, Zuoning Chen, Wen Xia, Yucheng Zhang, Yujuan Tan
发表日期
2015
研讨会论文
13th USENIX Conference on File and Storage Technologies (FAST 15)
页码范围
331-344
简介
Data deduplication has become a standard component in modern backup systems. In order to understand the fundamental tradeoffs in each of its design choices (such as prefetching and sampling), we disassemble data deduplication into a large N-dimensional parameter space. Each point in the space is of various parameter settings, and performs a tradeoff among backup and restore performance, memory footprint, and storage cost. Existing and potential solutions can be considered as specific points in the space. Then, we propose a general-purpose frame-work to evaluate various deduplication solutions in the space. Given that no single solution is perfect in all metrics, our goal is to find some reasonable solutions that have sustained backup performance and perform a suitable tradeoff between deduplication ratio, memory footprints, and restore performance. Our findings from extensive experiments using real-world workloads provide a detailed guide to make efficient design decisions according to the desired tradeoff.
引用总数
20152016201720182019202020212022202320247181618141417201312
学术搜索中的文章
M Fu, D Feng, Y Hua, X He, Z Chen, W Xia, Y Zhang… - 13th USENIX Conference on File and Storage …, 2015