Research directions for Principles of Data Management (Dagstuhl perspectives workshop 16151)

S Abiteboul, M Arenas, P Barceló, M Bienvenu… - arXiv preprint arXiv …, 2017 - arxiv.org
In April 2016, a community of researchers working in the area of Principles of Data
Management (PDM) joined in a workshop at the Dagstuhl Castle in Germany. The workshop …

[图书][B] Supporting Interactive Analytics and Visualization on Large Data

J Jia - 2017 - search.proquest.com
There is an increasing demand to visualize large datasets as human observable reports in
order to quickly draw insights and gain timely awareness from the data. An interactive user …

OLA-RAW: scalable exploration over raw data

Y Cheng, W Zhao, F Rusu - arXiv preprint arXiv:1702.00358, 2017 - arxiv.org
In-situ processing has been proposed as a novel data exploration solution in many domains
generating massive amounts of raw data, eg, astronomy, since it provides immediate SQL …

Scalable hash ripple join on spark

H Liu, J Xiao, F Peng - 2017 IEEE 23rd International …, 2017 - ieeexplore.ieee.org
Hash Ripple join is an online aggregation algorithm that can rapidly give good approximate
join results increases with the progress of the join operation and converges to the real result …

Efficient Online Processing for Advanced Analytics

MEMA El Seidy - 2017 - infoscience.epfl.ch
With the advent of emerging technologies and the Internet of Things, the importance of
online data analytics has become more pronounced. Businesses and companies are …

[PDF][PDF] Approximate Data Analytics Systems

C Fetzer - 2017 - core.ac.uk
Today, most modern online services make use of big data analytics systems to extract useful
information from the raw digital data. The data normally arrives as a continuous data stream …

The Power of Distance Distributions: Cost Models and Scheduling Policies for Quality-Controlled Similarity Queries

P Ciaccia, M Patella - International Conference on Similarity Search and …, 2017 - Springer
Approximate similarity queries are a practical way to obtain good, yet suboptimal, results
from large data sets without having to pay high execution costs. In this paper we analyze the …

Technical perspective: Optimized wandering for online aggregation

JF Naughton - ACM SIGMOD Record, 2017 - dl.acm.org
There is a rich history in the DBMS research literature involving sampling to estimate the
results of queries faster than they can be computed exactly. A particularly interesting …

Logic-partition based gaussian sampling for online aggregation

L Zhang, Y Wang, X Xu - … on Advanced Cloud and Big Data …, 2017 - ieeexplore.ieee.org
Online aggregation is a commonly used technology to return approximate query results over
random samples, which provides a fast way for users to obtain a trade-off between time and …

Convergent Interactive Inference with Leaky Joins

Y Yang, O Kennedy - Proceedings of the 20th International Conference …, 2017 - par.nsf.gov
One of the primary challenges in graphical models is inference, or re-constructing a
marginal probability from the graphical model's factorized representation. While tractable for …