F Rusu, A Dobra - ACM Transactions on Database Systems (TODS), 2008 - dl.acm.org
Sketching techniques provide approximate answers to aggregate queries both for data- streaming and distributed computation. Small space summaries that have linearity …
Peer Data Management Systems (Pdmss) are a novel, useful, but challenging paradigm for distributed data management and query processing. Conventional integrated information …
S Joshi, C Jermaine - IEEE Transactions on Knowledge and …, 2008 - ieeexplore.ieee.org
We consider the problem of creating a sample view of a database table. A sample view is an indexed materialized view that permits efficient sampling from an arbitrary range query over …
Perhaps the most flexible synopsis of a database is a uniform random sample of the data; such samples are widely used to speed up the processing of analytic queries and data …
F Xu, C Jermaine, A Dobra - ACM Transactions on Database Systems …, 2008 - dl.acm.org
Sampling is now a very important data management tool, to such an extent that an interface for database sampling is included in the latest SQL standard. In this article we reconsider in …
We demonstrate our prototype of the DBO database system. DBO is designed to facilitate scalable analytic processing over large data archives. DBO's analytic processing …
Random sampling is one of the most fundamental data management tools available. However, most current research involving sampling considers the problem of how to use a …
Aggregation queries are performed by first identifying outlier values, aggregating the outlier values, and sampling the remaining data after pruning the outlier values. The sampled data …
Stream-based data management enables the efficient analysis and processing of large volumes of data in distributed environments. This thesis presents network-aware …