Application-aware big data deduplication in cloud environment

Y Fu, N Xiao, H Jiang, G Hu… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
Deduplication has become a widely deployed technology in cloud data centers to improve IT
resources efficiency. However, traditional techniques face a great challenge in big data …

Application-aware local-global source deduplication for cloud backup services of personal storage

Y Fu, H Jiang, N Xiao, L Tian, F Liu… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
In personal computing devices that rely on a cloud storage environment for data backup, an
imminent challenge facing source deduplication for cloud backup services is the low …

Accelerating content-defined-chunking based data deduplication by exploiting parallelism

W Xia, D Feng, H Jiang, Y Zhang, V Chang… - Future Generation …, 2019 - Elsevier
Data deduplication, a data reduction technique that efficiently detects and eliminates
redundant data chunks and files, has been widely applied in large-scale storage systems …

Efficient hybrid inline and out-of-line deduplication for backup storage

YK Li, M Xu, CH Ng, PPC Lee - ACM Transactions on Storage (TOS), 2014 - dl.acm.org
Backup storage systems often remove redundancy across backups via inline deduplication,
which works by referring duplicate chunks of the latest backup to those of existing backups …

Boafft: Distributed deduplication for big data storage in the cloud

S Luo, G Zhang, C Wu, SU Khan… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
As data progressively grows within data centers, the cloud storage systems continuously
facechallenges in saving storage capacity and providing capabilities necessary to move big …

Ef-dedup: Enabling collaborative data deduplication at the network edge

S Li, T Lan, B Balasubramanian, MR Ra… - 2019 IEEE 39th …, 2019 - ieeexplore.ieee.org
The advent of IoT and edge computing will lead to massive amounts of data that need to be
collected and transmitted to online storage systems. To address this problem, we push data …

SAM: A semantic-aware multi-tiered source de-duplication framework for cloud backup

Y Tan, H Jiang, D Feng, L Tian, Z Yan… - … on Parallel Processing, 2010 - ieeexplore.ieee.org
Existing de-duplication solutions in cloud backup environment either obtain high
compression ratios at the cost of heavy de-duplication overheads in terms of increased …

EAD: elasticity aware deduplication manager for datacenters with multi-tier storage systems

Z Yang, Y Wang, J Bhamini, CC Tan, N Mi - Cluster Computing, 2018 - Springer
Abstract The popularity of Big Data applications places pressures on storage systems to
efficiently scale to meet the demand. At the same time, new developments like solid-state …

A scalable inline cluster deduplication framework for big data protection

Y Fu, H Jiang, N Xiao - Middleware 2012: ACM/IFIP/USENIX 13th …, 2012 - Springer
Cluster deduplication has become a widely deployed technology in data protection services
for Big Data to satisfy the requirements of service level agreement (SLA). However, it …

Hpdedup: A hybrid prioritized data deduplication mechanism for primary storage in the cloud

H Wu, C Wang, Y Fu, S Sakr, L Zhu, K Lu - arXiv preprint arXiv:1702.08153, 2017 - arxiv.org
Eliminating duplicate data in primary storage of clouds increases the cost-efficiency of cloud
service providers as well as reduces the cost of users for using cloud services. Existing …