A survey and classification of storage deduplication systems

J Paulo, J Pereira - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
The automatic elimination of duplicate data in a storage system, commonly known as
deduplication, is increasingly accepted as an effective technique to reduce storage costs …

Erasure coding in windows azure storage

C Huang, H Simitci, Y Xu, A Ogus, B Calder… - 2012 USENIX Annual …, 2012 - usenix.org
Windows Azure Storage (WAS) is a cloud storage system that provides customers the ability
to store seemingly limitless amounts of data for any duration of time. WAS customers have …

A study of practical deduplication

DT Meyer, WJ Bolosky - ACM Transactions on Storage (ToS), 2012 - dl.acm.org
We collected file system content data from 857 desktop computers at Microsoft over a span
of 4 weeks. We analyzed the data to determine the relative efficacy of data deduplication …

To {FUSE} or not to {FUSE}: Performance of {User-Space} file systems

BKR Vangoor, V Tarasov, E Zadok - 15th USENIX Conference on File …, 2017 - usenix.org
Traditionally, file systems were implemented as part of OS kernels. However, as complexity
of file systems grew, many new file systems began being developed in user space …

[PDF][PDF] {CAFTL}: A {Content-Aware} flash translation layer enhancing the lifespan of flash memory based solid state drives

F Chen, T Luo, X Zhang - 9th USENIX Conference on File and Storage …, 2011 - usenix.org
Abstract Although Flash Memory based Solid State Drive (SSD) exhibits high performance
and low power consumption, a critical concern is its limited lifespan along with the …

Pyramid codes: Flexible schemes to trade space for access efficiency in reliable data storage systems

C Huang, M Chen, J Li - ACM Transactions on Storage (TOS), 2013 - dl.acm.org
We design flexible schemes to explore the tradeoffs between storage space and access
efficiency in reliable data storage systems. Aiming at this goal, two new classes of erasure …

{EC-Cache}:{Load-Balanced},{Low-Latency} Cluster Caching with Online Erasure Coding

KV Rashmi, M Chowdhury, J Kosaian, I Stoica… - … USENIX Symposium on …, 2016 - usenix.org
Data-intensive clusters and object stores are increasingly relying on in-memory object
caching to meet the I/O performance demands. These systems routinely face the challenges …

[PDF][PDF] {ChunkStash}: Speeding Up Inline Storage Deduplication Using Flash Memory

B Debnath, S Sengupta, J Li - 2010 USENIX Annual Technical …, 2010 - usenix.org
Storage deduplication has received recent interest in the research community. In scenarios
where the backup process has to complete within short time windows, inline deduplication …

A secure erasure code-based cloud storage system with secure data forwarding

HY Lin, WG Tzeng - IEEE transactions on parallel and …, 2011 - ieeexplore.ieee.org
A cloud storage system, consisting of a collection of storage servers, provides long-term
storage services over the Internet. Storing data in a third party's cloud system causes serious …

[PDF][PDF] Building a high-performance deduplication system

F Guo, P Efstathopoulos - 2011 USENIX Annual Technical Conference …, 2011 - usenix.org
Modern deduplication has become quite effective at eliminating duplicates in data, thus
multiplying the effective capacity of disk-based backup systems, and enabling them as …