[PDF][PDF] MonetDB: Two decades of research in column-oriented database architectures

SIFGN Nes, SMSMM Kersten - Data Engineering, 2012 - Citeseer
MonetDB is a state-of-the-art open-source column-store database management system
targeting applications in need for analytics over large collections of data. MonetDB is …

The design and implementation of modern column-oriented database systems

D Abadi, P Boncz, S Harizopoulos… - … and Trends® in …, 2013 - nowpublishers.com
In this article, we survey recent research on column-oriented database systems, or column-
stores, where each attribute of a table is stored in a separate file or region on storage. Such …

A modern primer on processing in memory

O Mutlu, S Ghose, J Gómez-Luna… - … computing: from devices …, 2022 - Springer
Modern computing systems are overwhelmingly designed to move data to computation. This
design choice goes directly against at least three key trends in computing that cause …

Fast and sensitive protein alignment using DIAMOND

B Buchfink, C Xie, DH Huson - Nature methods, 2015 - nature.com
The alignment of sequencing reads against a protein reference database is a major
computational bottleneck in metagenomics and data-intensive evolutionary projects …

Shuffling, fast and slow: Scalable analytics on serverless infrastructure

Q Pu, S Venkataraman, I Stoica - 16th USENIX symposium on networked …, 2019 - usenix.org
Serverless computing is poised to fulfill the long-held promise of transparent elasticity and
millisecond-level pricing. To achieve this goal, service providers impose a finegrained …

Processing data where it makes sense: Enabling in-memory computation

O Mutlu, S Ghose, J Gómez-Luna… - Microprocessors and …, 2019 - Elsevier
Today's systems are overwhelmingly designed to move data to computation. This design
choice goes directly against at least three key trends in systems that cause performance …

Photon: A fast query engine for lakehouse systems

A Behm, S Palkar, U Agarwal, T Armstrong… - Proceedings of the …, 2022 - dl.acm.org
Many organizations are shifting to a data management paradigm called the" Lakehouse,"
which implements the functionality of structured data warehouses on top of unstructured …

CHARM: An efficient algorithm for closed itemset mining

MJ Zaki, CJ Hsiao - Proceedings of the 2002 SIAM international conference …, 2002 - SIAM
The set of frequent closed itemsets uniquely determines the exact frequency of all itemsets,
yet it can be orders of magnitude smaller than the set of all frequent itemsets. In this paper …

Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware

C Balkesen, J Teubner, G Alonso… - 2013 IEEE 29th …, 2013 - ieeexplore.ieee.org
The architectural changes introduced with multi-core CPUs have triggered a redesign of
main-memory join algorithms. In the last few years, two diverging views have appeared. One …

Integrating compression and execution in column-oriented database systems

D Abadi, S Madden, M Ferreira - Proceedings of the 2006 ACM SIGMOD …, 2006 - dl.acm.org
Column-oriented database system architectures invite a re-evaluation of how and when data
in databases is compressed. Storing data in a column-oriented fashion greatly increases the …