Data deduplication techniques for efficient cloud storage management: a systematic review

R Kaur, I Chana, J Bhattacharya - The Journal of Supercomputing, 2018 - Springer
The exponential growth of digital data in cloud storage systems is a critical issue presently
as a large amount of duplicate data in the storage systems exerts an extra load on it …

An empirical case study on the temporary file smell in dockerfiles

Z Lu, J Xu, Y Wu, T Wang, T Huang - IEEE Access, 2019 - ieeexplore.ieee.org
Docker is widely used in data centers to host services. The docker image adopts a
hierarchical storage architecture, which means that the docker image is composed of a set of …

An empirical analysis of vm startup times in public iaas clouds

J Hao, T Jiang, W Wang, IK Kim - 2021 IEEE 14th International …, 2021 - ieeexplore.ieee.org
VM startup time is an essential factor in designing elastic cloud applications. VM autoscaling
can reduce the under-/over-provisioning period of VMs with a precise estimation of VM …

Dispatching‐Rule Variants Algorithms for Used Spaces of Storage Supports

H Alquhayz, M Jemmali… - Discrete Dynamics in …, 2020 - Wiley Online Library
The paper is regarding the fair distribution of several files having different sizes to several
storage supports. With the existence of several storage supports and different files, we …

Dockerfile tf smell detection based on dynamic and static analysis methods

J Xu, Y Wu, Z Lu, T Wang - 2019 ieee 43rd annual computer …, 2019 - ieeexplore.ieee.org
Dockerfile is used to build docker image. In the image building process, temporary files are
frequently used to import applications and data. A careless use of Dockerfile may cause …

SecDedoop: secure deduplication with access control of big data in the HDFS/hadoop environment

P Ramya, C Sundar - Big Data, 2020 - liebertpub.com
With the rapid growth of storage providers, data deduplication is an essential storage
optimization technique that greatly minimizes data storage costs by storing a unique copy of …

Resemblance and mergence based indexing for high performance data deduplication

P Zhang, P Huang, X He, H Wang, K Zhou - Journal of Systems and …, 2017 - Elsevier
Data deduplication, a data redundancy elimination technique, has been widely employed in
many application environments to reduce data storage space. However, it is challenging to …

Secure client-side deduplication scheme for cloud with dual trusted execution environment

G Verma - IETE Journal of Research, 2023 - Taylor & Francis
Data deduplication is the process that eliminates excessive redundant data in a dataset to
improve storage and bandwidth utilization by significantly reducing the storage costs and …

An empirical analysis of VM startup times in public iaas clouds: An extended report

J Hao, T Jiang, W Wang, IK Kim - arXiv preprint arXiv:2107.03467, 2021 - arxiv.org
VM startup time is an essential factor in designing elastic cloud applications. For example, a
cloud application with autoscaling can reduce under-and over-provisioning of VM instances …

Parallel materialization of datalog programs with spark for scalable reasoning

H Wu, J Liu, T Wang, D Ye, J Wei, H Zhong - Web Information Systems …, 2016 - Springer
As the volume of semantic data increases rapidly, semantic reasoning becomes a very
challenging task. Existing scalable reasoners focus on fragments of OWL 2 RL (eg. RDFS …