Farview: Disaggregated memory with operator off-loading for database engines

D Korolija, D Koutsoukos, K Keeton, K Taranov… - arXiv preprint arXiv …, 2021 - arxiv.org
Cloud deployments disaggregate storage from compute, providing more flexibility to both the
storage and compute layers. In this paper, we explore disaggregation by taking it one step …

Good to the last bit: Data-driven encoding with codecdb

H Jiang, C Liu, J Paparrizos, AA Chien, J Ma… - Proceedings of the …, 2021 - dl.acm.org
Columnar databases rely on specialized encoding schemes to reduce storage
requirements. These encodings also enable efficient in-situ data processing. Nevertheless …

Reimagining codesign for advanced scientific computing: Report for the ascr workshop on reimagining codesign

J Ang, AA Chien, SD Hammond, A Hoisie, I Karlin… - 2022 - osti.gov
In March 2021, the US Department of Energy's Advanced Scientific Computing Research
program convened the Workshop on Reimagining Codesign. The workshop, also known as …

Tackling Hardware/Software co-design from a database perspective

G Alonso, T Roscoe, D Cock… - 10th Annual …, 2020 - research-collection.ethz.ch
Hardware is evolving at a very fast pace due to diverse trends in the IT industry. In the area
of data processing, it is fair to say that software often just reacts to these changes, trying to …

Data processing with fpgas on modern architectures

W Jiang, D Korolija, G Alonso - … of the 2023 International Conference on …, 2023 - dl.acm.org
Trends in hardware, the prevalence of the cloud, and the rise of highly demanding
applications have ushered an era of specialization that is quickly changing the way data is …

CXL and the Return of Scale-Up Database Engines

A Lerner, G Alonso - arXiv preprint arXiv:2401.01150, 2024 - arxiv.org
The growing trend towards specialization has led to a proliferation of accelerators and
alternative processing devices. When embedded in conventional computer architectures …

Raw Filtering of JSON data on FPGAs

T Hahn, A Becher, S Wildermann… - … Design, Automation & …, 2022 - ieeexplore.ieee.org
Many Big Data applications include the processing of data streams on semi-structured data
formats such as JSON. A disadvantage of such formats is that an application may spend a …

The collection Virtual Machine: an abstraction for multi-frontend multi-backend data analysis

I Müller, R Marroquín, D Koutsoukos… - Proceedings of the 16th …, 2020 - dl.acm.org
Getting the best performance from the ever-increasing number of hardware platforms has
been a recurring challenge for data processing systems. In recent years, the advent of data …

Modularis: Modular relational analytics over heterogeneous distributed platforms

D Koutsoukos, I Müller, R Marroquín, A Klimovic… - arXiv preprint arXiv …, 2020 - arxiv.org
The enormous quantity of data produced every day together with advances in data analytics
has led to a proliferation of data management and analysis systems. Typically, these …

Secure machine learning using shared data in a distributed database

MJ Holboke, J Langseth, S Ozer… - US Patent …, 2022 - Google Patents
(57) ABSTRACT A secure machine learning system of a database system can be
implemented to use secure shared data to train a machine learning model. To manage the …