作者
Jiansheng Wei, Hong Jiang, Ke Zhou, Dan Feng
发表日期
2010/5/3
研讨会论文
2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
页码范围
1-14
出版商
IEEE
简介
Deduplication has been widely used in disk-based secondary storage systems to improve space efficiency. However, there are two challenges facing scalable high-throughput deduplication storage. The first is the duplicate-lookup disk bottleneck due to the large size of data index that usually exceeds the available RAM space, which limits the deduplication throughput. The second is the storage node island effect resulting from duplicate data among multiple storage nodes that are difficult to eliminate. Existing approaches fail to completely eliminate the duplicates while simultaneously addressing the challenges. This paper proposes MAD2, a scalable high-throughput exact deduplication approach for network backup services. MAD2 eliminates duplicate data both at the file level and at the chunk level by employing four techniques to accelerate the deduplication process and evenly distribute data. First, MAD2 …
引用总数
2009201020112012201320142015201620172018201920202021202220232024284211226101212866732
学术搜索中的文章
J Wei, H Jiang, K Zhou, D Feng - 2010 IEEE 26th Symposium on Mass Storage Systems …, 2010