Bao: Making learned query optimization practical

R Marcus, P Negi, H Mao, N Tatbul… - Proceedings of the …, 2021 - dl.acm.org
Recent efforts applying machine learning techniques to query optimization have shown few
practical gains due to substantive training overhead, inability to adapt to changes, and poor …

Data modeling in the NoSQL world

P Atzeni, F Bugiotti, L Cabibbo, R Torlone - Computer Standards & …, 2020 - Elsevier
NoSQL systems have gained their popularity for many reasons, including the flexibility they
provide in organizing data, as they relax the rigidity provided by the relational model and by …

Debugging database queries: A survey of tools, techniques, and users

S Gathani, P Lim, L Battle - Proceedings of the 2020 CHI Conference on …, 2020 - dl.acm.org
Database management systems (or DBMSs) have been around for decades, and yet are still
difficult to use, particularly when trying to identify and fix errors in user programs (or queries) …

Get real: How benchmarks fail to represent the real world

A Vogelsgesang, M Haubenschild, J Finis… - Proceedings of the …, 2018 - dl.acm.org
Industrial as well as academic analytics systems are usually evaluated based on well-known
standard benchmarks, such as TPC-H or TPC-DS. These benchmarks test various …

SoK: Cryptanalysis of encrypted search with LEAKER-a framework for LEakage AttacK Evaluation on Real-world data

S Kamara, A Kati, T Moataz, T Schneider… - Cryptology ePrint …, 2021 - eprint.iacr.org
An encrypted search algorithm (ESA) allows a user to encrypt its data while preserving the
ability to search over it. As all practical solutions leak some information, cryptanalysis plays …

Data variety, come as you are in multi-model data warehouses

S Bimonte, E Gallinucci, P Marcel, S Rizzi - Information Systems, 2022 - Elsevier
Abstract Multi-model DBMSs (MMDBMSs) have been recently introduced to store and
seamlessly query heterogeneous data (structured, semi-structured, graph-based, etc.) in …

Logical design of multi-model data warehouses

S Bimonte, E Gallinucci, P Marcel, S Rizzi - Knowledge and Information …, 2023 - Springer
Multi-model DBMSs, which support different data models with a fully integrated backend,
have been shown to be beneficial to data warehouses and OLAP systems. Indeed, they can …

Data canopy: Accelerating exploratory statistical analysis

A Wasay, X Wei, N Dayan, S Idreos - Proceedings of the 2017 ACM …, 2017 - dl.acm.org
During exploratory statistical analysis, data scientists repeatedly compute statistics on data
sets to infer knowledge. Moreover, statistics form the building blocks of core machine …

FSST: fast random access string compression

P Boncz, T Neumann, V Leis - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
Strings are prevalent in real-world data sets. They often occupy a large fraction of the data
and are slow to process. In this work, we present Fast Static Symbol Table (FSST), a …

Beyond open vs. closed: Balancing individual privacy and public accountability in data sharing

M Young, L Rodriguez, E Keller, F Sun, B Sa… - Proceedings of the …, 2019 - dl.acm.org
Data too sensitive to be" open" for analysis and re-purposing typically remains" closed" as
proprietary information. This dichotomy undermines efforts to make algorithmic systems …