Implementing sorting in database systems

G Graefe - ACM Computing Surveys (CSUR), 2006 - dl.acm.org
Most commercial database systems do (or should) exploit many sorting techniques that are
publicly known, but not readily available in the research literature. These techniques …

MapReduce: simplified data processing on large clusters

J Dean, S Ghemawat - Communications of the ACM, 2008 - dl.acm.org
MapReduce is a programming model and an associated implementation for processing and
generating large datasets that is amenable to a broad variety of real-world tasks. Users …

[HTML][HTML] MapReduce: Simplified data processing on large clusters

J Dean, S Ghemawat - 2004 - usenix.org
MapReduce is a programming model and an associated implementation for processing and
generating large data sets. Users specify a _map_ function that processes a key/value pair …

Map-reduce-merge: simplified relational data processing on large clusters

H Yang, A Dasdan, RL Hsiao, DS Parker - Proceedings of the 2007 ACM …, 2007 - dl.acm.org
Map-Reduce is a programming model that enables easy development of scalable parallel
applications to process a vast amount of data on large clusters of commodity machines …

Eddies: Continuously adaptive query processing

R Avnur, JM Hellerstein - Proceedings of the 2000 ACM SIGMOD …, 2000 - dl.acm.org
In large federated and shared-nothing databases, resources can exhibit widely fluctuating
characteristics. Assumptions made at the time a query is submitted will rarely hold …

Tarazu: optimizing mapreduce on heterogeneous clusters

F Ahmad, ST Chakradhar, A Raghunathan… - ACM SIGARCH …, 2012 - dl.acm.org
Data center-scale clusters are evolving towards heterogeneous hardware for power, cost,
differentiated price-performance, and other reasons. MapReduce is a well-known …

[PDF][PDF] DBMSs on a modern processor: Where does time go?

A Ailamaki, DJ DeWitt, MD Hill… - VLDB'99, Proceedings of …, 1999 - infoscience.epfl.ch
Recent high-performance processors employ sophisticated techniques to overlap and
simultaneously execute multiple computation and memory operations. Intuitively, these …

Active disks: Programming model, algorithms and evaluation

A Acharya, M Uysal, J Saltz - ACM SIGOPS Operating Systems Review, 1998 - dl.acm.org
Several application and technology trends indicate that it might be both profitable and
feasible to move computation closer to the data that it processes. In this paper, we evaluate …

GPUTeraSort: high performance graphics co-processor sorting for large database management

N Govindaraju, J Gray, R Kumar… - Proceedings of the 2006 …, 2006 - dl.acm.org
We present a novel external sorting algorithm using graphics processors (GPUs) on large
databases composed of billions of records and wide keys. Our algorithm uses the data …

[PDF][PDF] Active storage for large-scale data mining and multimedia applications

E Riedel, G Gibson, C Faloutsos - … of 24th Conference on Very Large …, 1998 - Citeseer
The increasing performance and decreasing cost of processors and memory are causing
system intelligence to move into peripherals from the CPU. Storage system designers are …