BlinkDB: queries with bounded errors and bounded response times on very large data

S Agarwal, B Mozafari, A Panda, H Milner… - Proceedings of the 8th …, 2013 - dl.acm.org
In this paper, we present BlinkDB, a massively parallel, approximate query engine for
running interactive SQL queries on large volumes of data. BlinkDB allows users to trade-off …

Consistency rationing in the cloud: Pay only when it matters

T Kraska, M Hentschel, G Alonso… - Proceedings of the VLDB …, 2009 - dl.acm.org
Cloud storage solutions promise high scalability and low cost. Existing solutions, however,
differ in the degree of consistency they provide. Our experience using such systems …

Ycsb++ benchmarking and performance debugging advanced features in scalable table stores

S Patil, M Polte, K Ren, W Tantisiriroj, L Xiao… - Proceedings of the 2nd …, 2011 - dl.acm.org
Inspired by Google's BigTable, a variety of scalable, semi-structured, weak-semantic table
stores have been developed and optimized for different priorities such as query speed …

Transaction chains: achieving serializability with low latency in geo-distributed storage systems

Y Zhang, R Power, S Zhou, Y Sovran… - Proceedings of the …, 2013 - dl.acm.org
Currently, users of geo-distributed storage systems face a hard choice between having
serializable transactions with high latency, or limited or no transactions with low latency. We …

How is the weather tomorrow? Towards a benchmark for the cloud

C Binnig, D Kossmann, T Kraska… - Proceedings of the Second …, 2009 - dl.acm.org
Traditionally, the goal of benchmarking a software system is to evaluate its performance
under a particular workload for a fixed configuration. The most prominent examples for …

The power of choice in {Data-Aware} cluster scheduling

S Venkataraman, A Panda… - … USENIX Symposium on …, 2014 - usenix.org
Providing timely results in the face of rapid growth in data volumes has become important for
analytical frameworks. For this reason, frameworks increasingly operate on only a subset of …

Asynchronous view maintenance for VLSD databases

P Agrawal, A Silberstein, BF Cooper… - Proceedings of the …, 2009 - dl.acm.org
The query models of the recent generation of very large scale distributed (VLSD) shared-
nothing data storage systems, including our own PNUTS and others (eg BigTable, Dynamo …

A hierarchical approach to model web query interfaces for web source integration

EC Dragut, T Kabisch, C Yu, U Leser - Proceedings of the VLDB …, 2009 - dl.acm.org
Much data in the Web is hidden behind Web query interfaces. In most cases the only means
to" surface" the content of a Web database is by formulating complex queries on such …

Feeding frenzy: selectively materializing users' event feeds

A Silberstein, J Terrace, BF Cooper… - Proceedings of the 2010 …, 2010 - dl.acm.org
Near real-time event streams are becoming a key feature of many popular web applications.
Many web sites allow users to create a personalized feed by selecting one or more event …

Continuum: A platform for cost-aware, low-latency continual learning

H Tian, M Yu, W Wang - Proceedings of the ACM Symposium on Cloud …, 2018 - dl.acm.org
Many machine learning applications operate in dynamic environments that change over
time, in which models must be continually updated to capture the recent trend in data …