BlinkDB: queries with bounded errors and bounded response times on very large data

S Agarwal, B Mozafari, A Panda, H Milner… - Proceedings of the 8th …, 2013 - dl.acm.org
In this paper, we present BlinkDB, a massively parallel, approximate query engine for
running interactive SQL queries on large volumes of data. BlinkDB allows users to trade-off …

[PDF][PDF] 云数据管理系统中查询技术研究综述

史英杰, 孟小峰 - 计算机学报, 2013 - cjc.ict.ac.cn
摘要作为一种全新的互联网应用模式, 云计算在工业界和学术界备受关注.
人们可以通过终端设备便捷地获取云端服务, 并以按需使用的方式获得存储资源 …

Dynamic reduction of query result sets for interactive visualizaton

L Battle, M Stonebraker, R Chang - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
Modern database management systems (DBMS) have been designed to efficiently store,
manage and perform computations on massive amounts of data. In contrast, many existing …

Scalable progressive analytics on big data in the cloud

B Chandramouli, J Goldstein, A Quamar - Proceedings of the VLDB …, 2013 - dl.acm.org
Analytics over the increasing quantity of data stored in the Cloud has become very
expensive, particularly due to the pay-as-you-go Cloud computation model. Data scientists …

A sampling algebra for aggregate estimation

S Nirkhiwale, A Dobra, C Jermaine - arXiv preprint arXiv:1307.0193, 2013 - arxiv.org
As of 2005, sampling has been incorporated in all major database systems. While efficient
sampling techniques are realizable, determining the accuracy of an estimate obtained from …

Partition-based online aggregation with shared sampling in the cloud

YX Wang, JZ Luo, AB Song, F Dong - Journal of computer science and …, 2013 - Springer
Online aggregation is an attractive sampling-based technology to response aggregation
queries by an estimate to the final result, with the confidence interval becoming tighter over …

[PDF][PDF] A native and adaptive approach for linked stream data processing

D Le Phuoc - Doctoraatsthesis, NUI Galway, 2013 - aran.library.nuigalway.ie
Sensors, mobile devices and social platforms generate an immense amount of stream data
in various formats and schemata. For these areas, the idea of Linked Stream Data is to …

Processing online aggregation on skewed data in mapreduce

Y Gan, X Meng, Y Shi - Proceedings of the fifth international workshop …, 2013 - dl.acm.org
In online aggregation, a system constantly maintains an estimate of the final answer to an
aggregate query throughout execution, along with statistically meaningful bounds for the …

[PDF][PDF] Implementation of Skyline Sweeping Algorithm

B Veerendra, BV Reddy - International Journal of Computer Science & …, 2013 - ijcset.com
Searching keywords in databases is complex task than search in files. Information Retrieval
(IR) process search keywords from text files and it is very important that queering keyword to …

Histograms as statistical estimators for aggregate queries

L Chen, A Dobra - Information Systems, 2013 - Elsevier
The traditional statistical assumption for interpreting histograms and justifying approximate
query processing methods based on them is that all elements in a bucket have the same …