ComMapReduce: An improvement of MapReduce with lightweight communication mechanisms

L Ding, G Wang, J Xin, X Wang, S Huang… - Data & Knowledge …, 2013 - Elsevier
As a parallel programming framework, MapReduce can process scalable and parallel
applications with large scale datasets. The executions of Mappers and Reducers are …

Memory-efficient groupby-aggregate using compressed buffer trees

H Amur, W Richter, DG Andersen, M Kaminsky… - Proceedings of the 4th …, 2013 - dl.acm.org
The rapid growth of fast analytics systems, that require data processing in memory, makes
memory capacity an increasingly-precious resource. This paper introduces a new …

Evolutionary multiobjective query workload optimization of Cloud data warehouses

T Dokeroglu, SA Sert, MS Cinar - The Scientific World Journal, 2014 - Wiley Online Library
With the advent of Cloud databases, query optimizers need to find paretooptimal solutions in
terms of response time and monetary cost. Our novel approach minimizes both objectives by …

Partition-based online aggregation with shared sampling in the cloud

YX Wang, JZ Luo, AB Song, F Dong - Journal of computer science and …, 2013 - Springer
Online aggregation is an attractive sampling-based technology to response aggregation
queries by an estimate to the final result, with the confidence interval becoming tighter over …

Distributed outlier detection using compressive sensing

Y Yan, J Zhang, B Huang, X Sun, J Mu… - Proceedings of the …, 2015 - dl.acm.org
Computing outliers and related statistical aggregation functions from large-scale big data
sources is a critical operation in many cloud computing scenarios, eg service quality …

Workload profiling

SD Dolas, PD Codding - US Patent 10,356,167, 2019 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on computer
storage media, for profiling and configuring work on a cluster of computer nodes. One …

Scalable distributed data cube computation for large-scale multidimensional data analysis on a Spark cluster

S Lee, S Kang, J Kim, EJ Yu - Cluster Computing, 2019 - Springer
A data cube is a powerful analytical tool that stores all aggregate values over a set of
dimensions. It provides users with a simple and efficient means of performing complex data …

Socialcloud: Using social networks for building distributed computing services

A Mohaisen, H Tran, A Chandra, Y Kim - arXiv preprint arXiv:1112.2254, 2011 - arxiv.org
In this paper we investigate a new computing paradigm, called SocialCloud, in which
computing nodes are governed by social ties driven from a bootstrapping trust-possessing …

Concurrency optimized task scheduling for workflows in cloud

Y Gao, H Ma, H Zhang, X Kong… - 2013 IEEE Sixth …, 2013 - ieeexplore.ieee.org
Recent years, more and more enterprises migrate their applications to the cloud for cost
saving. During the application migration, an application usually needs to be re-designed …

System and method for distributed database query engines

R Murthy, R Goel - US Patent 10,210,221, 2019 - Google Patents
Techniques for a system capable of performing low-latency database query processing are
disclosed herein. The system includes a gateway server and a plurality of worker nodes. The …