Check out the big brain on BRAD: simplifying cloud data processing with learned automated data meshes

T Kraska, T Li, S Madden, M Markakis, A Ngom… - Proceedings of the …, 2023 - dl.acm.org
The last decade of database research has led to the prevalence of specialized systems for
different workloads. Consequently, organizations often rely on a combination of specialized …

Make your database system dream of electric sheep: towards self-driving operation

A Pavlo, M Butrovich, L Ma, P Menon, WS Lim… - Proceedings of the …, 2021 - dl.acm.org
Database management systems (DBMSs) are notoriously difficult to deploy and administer.
Self-driving DBMSs seek to remove these impediments by managing themselves …

Sagedb: An instance-optimized data analytics system

J Ding, R Marcus, A Kipf, V Nathan… - Proceedings of the …, 2022 - par.nsf.gov
Modern data systems are typically both complex and general-purpose. They are complex
because of the numerous internal knobs and parameters that users need to manually tune in …

What Goes Around Comes Around... And Around...

M Stonebraker, A Pavlo - ACM Sigmod Record, 2024 - dl.acm.org
Two decades ago, one of us co-authored a paper commenting on the previous 40 years of
data modelling research and development [188]. That paper demonstrated that the relational …

Kea: Tuning an exabyte-scale data infrastructure

Y Zhu, S Krishnan, K Karanasos, I Tarte… - Proceedings of the …, 2021 - dl.acm.org
Microsoft's internal big-data infrastructure is one of the largest in the world---with over 300k
machines running billions of tasks from over 0.6 M daily jobs. Operating this infrastructure is …

Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD

GX Yu, Z Wu, F Kossmann, T Li, M Markakis… - Proceedings of the …, 2024 - dl.acm.org
Modern organizations manage their data with a wide variety of specialized cloud database
engines (eg, Aurora, BigQuery, etc.). However, designing and managing such infrastructures …

Dynamic index construction with deep reinforcement learning

S Wu, Y Li, H Zhu, J Zhao, G Chen - Data Science and Engineering, 2022 - Springer
Thanks to the rapid advances in artificial intelligence, a brand new venue for database
performance optimization is through deep neural networks and the reinforcement learning …

[PDF][PDF] FLIRT: A Fast Learned Index for Rolling Time frames.

G Yang, L Liang, A Hadian, T Heinis - EDBT, 2023 - openproceedings.org
Efficiently managing and querying sliding windows is a key component in stream processing
systems. Conventional index structures such as the B+ Tree are not efficient for handling a …

A unified and efficient coordinating framework for autonomous DBMS tuning

X Zhang, Z Chang, H Wu, Y Li, J Chen, J Tan… - Proceedings of the …, 2023 - dl.acm.org
Recently using machine learning (ML) based techniques to optimize the performance of
modern database management systems (DBMSs) has attracted intensive interest from both …

MB2: decomposed behavior modeling for self-driving database management systems

L Ma, W Zhang, J Jiao, W Wang, M Butrovich… - Proceedings of the …, 2021 - dl.acm.org
Database management systems (DBMSs) are notoriously difficult to deploy and administer.
The goal of a self-driving DBMS is to remove these impediments by managing itself …