The design and implementation of modern column-oriented database systems

D Abadi, P Boncz, S Harizopoulos… - … and Trends® in …, 2013 - nowpublishers.com
In this article, we survey recent research on column-oriented database systems, or column-
stores, where each attribute of a table is stored in a separate file or region on storage. Such …

Fusing similarity models with markov chains for sparse sequential recommendation

R He, J McAuley - 2016 IEEE 16th international conference on …, 2016 - ieeexplore.ieee.org
Predicting personalized sequential behavior is a key task for recommender systems. In order
to predict user actions such as the next product to purchase, movie to watch, or place to visit …

DB2 with BLU acceleration: So much more than just a column store

V Raman, G Attaluri, R Barber, N Chainani… - Proceedings of the …, 2013 - dl.acm.org
DB2 with BLU Acceleration deeply integrates innovative new techniques for defining and
processing column-organized tables that speed read-mostly Business Intelligence queries …

SIMD-scan: ultra fast in-memory table scan using on-chip vector processing units

T Willhalm, N Popovici, Y Boshmaf, H Plattner… - Proceedings of the …, 2009 - dl.acm.org
The availability of huge system memory, even on standard servers, generated a lot of
interest in main memory database engines. In data warehouse systems, highly compressed …

Bitweaving: Fast scans for main memory data processing

Y Li, JM Patel - Proceedings of the 2013 ACM SIGMOD International …, 2013 - dl.acm.org
This paper focuses on running scans in a main memory data processing system at" bare
metal" speed. Essentially, this means that the system must aim to process data at or near the …

Mison: a fast JSON parser for data analytics

Y Li, NR Katsipoulakis, B Chandramouli… - Proceedings of the …, 2017 - dl.acm.org
The growing popularity of the JSON format has fueled increased interest in loading and
processing JSON data within analytical data processing systems. However, in many …

Smoke: Fine-grained lineage at interactive speed

F Psallidas, E Wu - arXiv preprint arXiv:1801.07237, 2018 - arxiv.org
Data lineage describes the relationship between individual input and output data items of a
workflow, and has served as an integral ingredient for both traditional (eg, debugging …

BtrBlocks: efficient columnar compression for data lakes

M Kuschewski, D Sauerwein, A Alhomssi… - Proceedings of the ACM …, 2023 - dl.acm.org
Analytics is moving to the cloud and data is moving into data lakes. These reside on object
storage services like S3 and enable seamless data sharing and system interoperability. To …

Here are my data files. here are my queries. where are my results?

S Idreos, I Alagiannis, R Johnson… - Proceedings of 5th …, 2011 - infoscience.epfl.ch
Database management systems (DBMS) provide incredible flexibility and performance
when it comes to query processing, scalability and accuracy. To fully exploit DBMS features …

Database analytics acceleration using FPGAs

B Sukhwani, H Min, M Thoennes, P Dube… - Proceedings of the 21st …, 2012 - dl.acm.org
Business growth and technology advancements have resulted in growing amounts of
enterprise data. To gain valuable business insight and competitive advantage, businesses …