Query processing on heterogeneous CPU/GPU systems

V Rosenfeld, S Breß, V Markl - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
Due to their high computational power and internal memory bandwidth, graphic processing
units (GPUs) have been extensively studied by the database systems research community …

Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age

V Leis, P Boncz, A Kemper, T Neumann - Proceedings of the 2014 ACM …, 2014 - dl.acm.org
With modern computer architecture evolving, two problems conspire against the state-of-the-
art approaches in parallel query execution:(i) to take advantage of many-cores, all query …

Column-stores vs. row-stores: how different are they really?

DJ Abadi, SR Madden, N Hachem - Proceedings of the 2008 ACM …, 2008 - dl.acm.org
There has been a significant amount of excitement and recent work on column-oriented
database systems (" column-stores"). These database systems have been shown to perform …

Towards scalable dataframe systems

D Petersohn, S Macke, D Xin, W Ma, D Lee… - arXiv preprint arXiv …, 2020 - arxiv.org
Dataframes are a popular abstraction to represent, prepare, and analyze data. Despite the
remarkable success of dataframe libraries in Rand Python, dataframes face performance …

[PDF][PDF] Database Cracking.

S Idreos, ML Kersten, S Manegold - CIDR, 2007 - people.eecs.berkeley.edu
Database indices provide a non-discriminative navigational infrastructure to localize tuples
of interest. Their maintenance cost is taken during database updates. In this paper, we study …

MRShare: sharing across multiple queries in MapReduce

T Nykiel, M Potamias, C Mishra, G Kollios… - Proceedings of the …, 2010 - dl.acm.org
Large-scale data analysis lies in the core of modern enterprises and scientific research. With
the emergence of cloud computing, the use of an analytical query processing infrastructure …

Major technical advancements in apache hive

Y Huai, A Chauhan, A Gates, G Hagleitner… - Proceedings of the …, 2014 - dl.acm.org
Apache Hive is a widely used data warehouse system for Apache Hadoop, and has been
adopted by many organizations for various big data analytics applications. Closely working …

Data-oriented transaction execution

I Pandis, R Johnson, N Hardavellas… - Proceedings of the …, 2010 - infoscience.epfl.ch
While hardware technology has undergone major advancements over the past decade,
transaction processing systems have remained largely unchanged. The number of cores on …

Sorting networks on FPGAs

R Mueller, J Teubner, G Alonso - The VLDB Journal, 2012 - Springer
Computer architectures are quickly changing toward heterogeneous many-core systems.
Such a trend opens up interesting opportunities but also raises immense challenges since …

Ysmart: Yet another sql-to-mapreduce translator

R Lee, T Luo, Y Huai, F Wang, Y He… - 2011 31st International …, 2011 - ieeexplore.ieee.org
MapReduce has become an effective approach to big data analytics in large cluster
systems, where SQL-like queries play important roles to interface between users and …