Check out the big brain on BRAD: simplifying cloud data processing with learned automated data meshes

T Kraska, T Li, S Madden, M Markakis, A Ngom… - Proceedings of the …, 2023 - dl.acm.org
The last decade of database research has led to the prevalence of specialized systems for
different workloads. Consequently, organizations often rely on a combination of specialized …

[HTML][HTML] To prompt or not to prompt: Navigating the use of large language models for integrating and modeling heterogeneous data

A Remadi, K El Hage, Y Hobeika, F Bugiotti - Data & Knowledge …, 2024 - Elsevier
Manually integrating data of diverse formats and languages is vital to many artificial
intelligence applications. However, the task itself remains challenging and time-consuming …

The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto …

W Zhang, WS Lim, M Butrovich, A Pavlo - Proceedings of the VLDB …, 2024 - dl.acm.org
Existing machine learning (ML) approaches to automatically optimize database
management systems (DBMSs) only target a single configuration space at a time (eg, knobs …

Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems

WS Lim, L Ma, W Zhang, M Butrovich, S Arch… - Proceedings of the …, 2024 - dl.acm.org
Autonomous database management systems (DBMSs) aim to optimize themselves
automatically without human guidance. They rely on machine learning (ML) models that …

Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD

GX Yu, Z Wu, F Kossmann, T Li, M Markakis… - Proceedings of the …, 2024 - dl.acm.org
Modern organizations manage their data with a wide variety of specialized cloud database
engines (eg, Aurora, BigQuery, etc.). However, designing and managing such infrastructures …

[PDF][PDF] Learned Selection Strategy for Lightweight Integer Compression Algorithms.

L Woltmann, P Damme, C Hartmann, D Habich… - EDBT, 2023 - openproceedings.org
Data compression has recently experienced a revival in the domain of in-memory column
stores. In this field, a large corpus of lightweight integer compression algorithms plays a …

JoinGym: An Efficient Query Optimization Environment for Reinforcement Learning

K Wang, J Wang, Y Li, N Kallus, I Trummer… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we present\textsc {JoinGym}, an efficient and lightweight query optimization
environment for reinforcement learning (RL). Join order selection (JOS) is a classic NP-hard …

Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD--Extended Version

GX Yu, Z Wu, F Kossmann, T Li, M Markakis… - arXiv preprint arXiv …, 2024 - arxiv.org
Modern organizations manage their data with a wide variety of specialized cloud database
engines (eg, Aurora, BigQuery, etc.). However, designing and managing such infrastructures …

[PDF][PDF] JoinGym: An Efficient Join Order Selection Environment

J Wang, K Wang, Y Li, N Kallus… - Reinforcement …, 2024 - rlj.cs.umass.edu
Join order selection (JOS), the ordering of join operations to minimize query execution cost,
is a core NP-hard combinatorial optimization problem in database query optimization. We …

RDBlab: An Artificial Simulation System for RDBMSs

Y Yan, H Wang, J Huang, J Geng, Z Wang… - Asia-Pacific Web (APWeb …, 2023 - Springer
With the development of cloud database, the simulation system of RDBMSs become
increasing important for avoiding database failures. For example, a simulation system could …