The design of fast content-defined chunking for data deduplication based storage systems

W Xia, X Zou, H Jiang, Y Zhou, C Liu… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Content-Defined Chunking (CDC) has been playing a key role in data deduplication
systems recently due to its high redundancy detection ability. However, existing CDC-based …

{DupHunter}: Flexible {High-Performance} Deduplication for Docker Registries

N Zhao, H Albahar, S Abraham, K Chen… - 2020 USENIX Annual …, 2020 - usenix.org
Containers are increasingly used in a broad spectrum of applications from cloud services to
storage to supporting emerging edge computing paradigm. This has led to an explosive …

Improving restore performance for in-line backup system combining deduplication and delta compression

Y Zhang, Y Yuan, D Feng, C Wang… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Data deduplication, though being efficient in removing duplicate chunks, introduces chunk
fragmentation which decreases restore performance. Rewriting algorithms are proposed to …

Efficient big-data access: Taxonomy and a comprehensive survey

A Alazzawe, A Pal, K Kant - IEEE transactions on big data, 2020 - ieeexplore.ieee.org
The emerging systems are not only generating huge amounts of data but also expect this
data to be analyzed expeditiously to drive online decision-making and control. Thus …

Improving restore performance of packed datasets in deduplication systems via reducing persistent fragmented chunks

Y Zhang, M Fu, X Wu, F Wang, Q Wang… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Data deduplication, though being efficient for redundancy elimination in storage systems,
introduces chunk fragmentation which severely decreases restore performance. Rewriting …

A similarity clustering-based deduplication strategy in cloud storage systems

S Long, Z Li, Z Liu, Q Deng, S Oh… - 2020 IEEE 26th …, 2020 - ieeexplore.ieee.org
Deduplication is a data redundancy elimination technique, designed to save system storage
resources by reducing redundant data in cloud storage systems. With the development of …

iTRIM: I/o-aware TRIM for improving user experience on mobile devices

Y Liang, C Ji, C Fu, R Ausavarungnirun… - … on Computer-Aided …, 2020 - ieeexplore.ieee.org
TRIM is a recommended command to deliver data invalidation information of the file system
to flash storage. It is issued on both system level and device level. Since it can reduce the …

A cloud-native architecture for replicated data services

H Saxena, J Pound - 12th USENIX Workshop on Hot Topics in Cloud …, 2020 - usenix.org
Many services replicate data for fault-tolerant storage of the data and high-availability of the
service. When deployed in the cloud, the replication performed by these services provides …

Improving the Restore Performance via Physical-Locality Middleware for Backup Systems

P Li, Y Hua, Q Cao, M Zhang - Proceedings of the 21st International …, 2020 - dl.acm.org
Data deduplication as an important middleware plays an essential role in current backup
systems due to the high space efficiency, which however suffers from low restore …

Pbccf: Accelerated deduplication by prefetching backup content correlated fingerprints

Y Qin, X Zhang, DJ Lilja - 2020 IEEE 38th International …, 2020 - ieeexplore.ieee.org
Deduplication provides significant benefits for accelerating large-scale storage systems,
particularly backup systems, by eliminating the redundancy of the streaming data. Given the …