Distributed data management using MapReduce

F Li, BC Ooi, MT Özsu, S Wu - ACM Computing Surveys (CSUR), 2014 - dl.acm.org
MapReduce is a framework for processing and managing large-scale datasets in a
distributed cluster, which has been used for applications such as generating search indexes …

Hybrid query optimization for hard-to-compress bit-vectors

G Guzun, G Canahuate - The VLDB Journal, 2016 - Springer
Bit-vectors are widely used for indexing and summarizing data due to their efficient
processing in modern computers. Sparse bit-vectors can be further compressed to reduce …

[HTML][HTML] Dynamic multidimensional index for large-scale cloud data

J He, Y Wu, Y Dong, Y Zhang, W Zhou - Journal of Cloud Computing, 2016 - Springer
Although several cloud storage systems have been proposed, most of them can provide
highly efficient point queries only because of the key-value pairs storing mechanism. For …

CORES: towards scan-optimized columnar storage for nested records

W Wen, Y Li, W Li, L Deng, Y He - ACM Transactions on Storage (TOS), 2019 - dl.acm.org
The relatively high cost of record deserialization is increasingly becoming the bottleneck of
column-based storage systems in tree-structured applications [58]. Due to record …

Hm: A column-oriented mapreduce system on hybrid storage

S Wu, G Chen, K Chen, F Li… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
The solid-state hybrid drive (SSHD) incorporates a small NAND flash memory into a hard
drive, resulting in an integrated device with combined Hard Disk Drive (HDD) and Solid …

Accelerating Columnar Storage Based on Asynchronous Skipping Strategy

W Li, Z Yang, L Deng, Z Cheng, W Wen, Y He - Big Data Research, 2023 - Elsevier
Many database applications, such as OnLine Analytical Processing (OLAP), web-based
information extraction or scientific computation, need to select a subset of fields based on …

SLC-index: A scalable skip list-based index for cloud data processing

J He, S Yao, L Cai, W Zhou - Journal of Central South University, 2018 - Springer
Due to the increasing number of cloud applications, the amount of data in the cloud shows
signs of growing faster than ever before. The nature of cloud computing requires cloud data …

Handling multi-dimensional complex queries in key-value data stores

H Sun, Y Tang, Q Wang, X Liu - Information Systems, 2017 - Elsevier
With the advent of the era of cloud computing and big data, in order to cope with vast
amounts of data, a number of key-value databases have emerged. These systems provide …

Skipnet-octree based indexing technique for cloud database management system

S Malhotra, MN Doja, B Alam, M Alam - International Journal of …, 2018 - igi-global.com
This article describes how data indexing plays a very crucial role in query processing.
Systems based on traditional indexes like B-tree, R-tree, Bitmap, inverted indexing …

Research on index mechanism of hbase based on coprocessor for sensor data

F Ye, S Zhu, Y Lou, Z Liu, Y Chen… - 2019 IEEE 43rd Annual …, 2019 - ieeexplore.ieee.org
In order to provide effective management of big data, almost two hundred different NoSQL
stores have been developed, among which HBase is one of the best known. When …