Triton join: Efficiently scaling to a large join state on gpus with fast interconnects

C Lutz, S Breß, S Zeuch, T Rabl, V Markl - Proceedings of the 2022 …, 2022 - dl.acm.org
Database management systems are facing growing data volumes. Previous research
suggests that GPUs are well-equipped to quickly process joins and similar stateful …

[PDF][PDF] Is fpga useful for hash joins?

X Chen, Y Chen, R Bajaj, J He, B He, WF Wong… - CIDR, 2020 - comp.nus.edu.sg
Benefiting from the fine-grained parallelism and energy efficiency, heterogeneous
computing platforms featuring FP-GAs are becoming more and more common in data …

Briskstream: Scaling data stream processing on shared-memory multicore architectures

S Zhang, J He, AC Zhou, B He - … of the 2019 International Conference on …, 2019 - dl.acm.org
We introduce BriskStream, an in-memory data stream processing system (DSPSs)
specifically designed for modern shared-memory multicore architectures. BriskStream's key …

Accelerating database systems using FPGAs: A survey

P Papaphilippou, W Luk - 2018 28th International Conference …, 2018 - ieeexplore.ieee.org
Database systems are key to a variety of applications, and FPGA-based accelerators have
shown promise in supporting high-performance database systems. This survey presents a …

VIP: A SIMD vectorized analytical query engine

O Polychroniou, KA Ross - The VLDB Journal, 2020 - Springer
Query execution engines for analytics are continuously adapting to the underlying hardware
in order to maximize performance. Wider SIMD registers and more complex SIMD instruction …

Streambox-hbm: Stream analytics on high bandwidth hybrid memory

H Miao, M Jeon, G Pekhimenko, KS McKinley… - Proceedings of the …, 2019 - dl.acm.org
Stream analytics has an insatiable demand for memory and performance. Emerging hybrid
memories combine commodity DDR4 DRAM with 3D-stacked High Bandwidth Memory …

Towards practical vectorized analytical query engines

O Polychroniou, KA Ross - … of the 15th International Workshop on Data …, 2019 - dl.acm.org
Query execution engines are adapting to the underlying hardware in order to maximize
performance. Wider SIMD registers and more complex SIMD instruction sets are emerging in …

Interleaved multi-vectorizing

Z Fang, B Zheng, C Weng - Proceedings of the VLDB Endowment, 2019 - dl.acm.org
SIMD is an instruction set in mainstream processors, which provides the data level
parallelism to accelerate the performance of applications. However, its advantages diminish …

Accelerating data filtering for database using FPGA

X Sun, CJ Xue, J Yu, TW Kuo, X Liu - Journal of Systems Architecture, 2021 - Elsevier
In the big data era, in order to relieve computational pressure on overloaded CPU caused by
ever increasing amount of data, many researches focus on hardware acceleration using …

Joins on high-bandwidth memory: a new level in the memory hierarchy

C Pohl, KU Sattler, G Graefe - The VLDB Journal, 2020 - Springer
High-bandwidth memory (HBM) gives an additional opportunity for hardware performance
benefits. The high available bandwidth compared to regular DRAM allows execution of …