Scalable distributed stream join processing

Q Lin, BC Ooi, Z Wang, C Yu - Proceedings of the 2015 ACM SIGMOD …, 2015 - dl.acm.org
Efficient and scalable stream joins play an important role in performing real-time analytics for
many cloud applications. However, like in conventional database processing, online theta …

Query: A framework for integrating entity resolution with query processing

H Altwaijry, S Mehrotra, DV Kalashnikov - Proceedings of the VLDB …, 2015 - dl.acm.org
This paper explores an analysis-aware data cleaning architecture for a large class of SPJ
SQL queries. In particular, we propose QuERy, a novel framework for integrating entity …

Spatial online sampling and aggregation

L Wang, R Christensen, F Li, K Yi - Proceedings of the VLDB Endowment, 2015 - dl.acm.org
The massive adoption of smart phones and other mobile devices has generated humongous
amount of spatial and spatio-temporal data. The importance of spatial analytics and …

Speculative approximations for terascale distributed gradient descent optimization

C Qin, F Rusu - Proceedings of the Fourth Workshop on Data analytics …, 2015 - dl.acm.org
Model calibration is a major challenge faced by the plethora of statistical analytics packages
that are increasingly used in Big Data applications. Identifying the optimal model parameters …

STORM: Spatio-temporal online reasoning and management of large spatio-temporal data

R Christensen, L Wang, F Li, K Yi, J Tang… - Proceedings of the 2015 …, 2015 - dl.acm.org
We present the STORM system to enable spatio-temporal online reasoning and
management of large spatio-temporal data. STORM supports interactive spatio-temporal …

Efficient processing of skyline-join queries over multiple data sources

M Nagendra, KS Candan - ACM Transactions on Database Systems …, 2015 - dl.acm.org
Efficient processing of skyline queries has been an area of growing interest. Many of the
earlier skyline techniques assumed that the skyline query is applied to a single data table …

[PDF][PDF] Scalable Analytics Model Calibration with Online Aggregation.

F Rusu, C Qin, M Torres - IEEE Data Eng. Bull., 2015 - faculty.ucmerced.edu
Abstract Model calibration is a major challenge faced by the plethora of statistical analytics
packages that are increasingly used in Big Data applications. Identifying the optimal model …

An efficient block sampling strategy for online aggregation in the cloud

X Ci, X Meng - Web-Age Information Management: 16th International …, 2015 - Springer
As the development of social network, mobile Internet, etc., an increasing amount of data are
being generated, which beyond the processing ability of traditional data management tools …

Speculative approximations for terascale analytics

C Qin, F Rusu - arXiv preprint arXiv:1501.00255, 2015 - arxiv.org
Model calibration is a major challenge faced by the plethora of statistical analytics packages
that are increasingly used in Big Data applications. Identifying the optimal model parameters …

Progressive online aggregation in a distributed stream system

D Yang, J Cao, S Wu, J Wang - Journal of Systems and Software, 2015 - Elsevier
Interactive query processing aims at generating approximate results with minimum response
time. However, it is quite difficult for a batch-oriented processing system to progressively …