Learnedsqlgen: Constraint-aware sql generation using reinforcement learning

L Zhang, C Chai, X Zhou, G Li - … of the 2022 International Conference on …, 2022 - dl.acm.org
Many database optimization problems, eg, slow SQL diagnosis, database testing, optimizer
tuning, require a large volume of SQL queries. Due to privacy issues, it is hard to obtain real …

Comprehensive and efficient workload compression

S Deep, A Gruenheid, P Koutris, J Naughton… - arXiv preprint arXiv …, 2020 - arxiv.org
This work studies the problem of constructing a representative workload from a given input
analytical query workload where the former serves as an approximation with guarantees of …

Optimal resource allocation for serverless queries

A Pimpley, S Li, A Srivastava, V Rohra, Y Zhu… - arXiv preprint arXiv …, 2021 - arxiv.org
Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-
data services. At the same time, it is incredibly hard for users to allocate resources per query …

[PDF][PDF] Towards Optimal Resource Allocation for Big Data Analytics.

A Pimpley, S Li, R Sen, S Srinivasan, A Jindal - EDBT, 2022 - openproceedings.org
Optimizing resource allocation for analytical workloads is vital for reducing operational costs
in modern cloud-oriented query processing services. At the same time, it is incredibly hard …

Sparkcruise: Workload optimization in managed spark clusters at microsoft

A Roy, A Jindal, P Gomatam, X Ouyang… - Proceedings of the …, 2021 - dl.acm.org
Today cloud companies offer fully managed Spark services. This has made it easy to
onboard new customers but has also increased the volume of users and their workload …

Doppler: automated SKU recommendation in migrating SQL workloads to the cloud

J Cahoon, W Wang, Y Zhu, K Lin, S Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
Selecting the optimal cloud target to migrate SQL estates from on-premises to the cloud
remains a challenge. Current solutions are not only time-consuming and error-prone …

DBMS annihilator: a high-performance database workload generator in action

A Lerner, M Jasny, T Jepsen, C Binnig… - Proceedings of the …, 2022 - dl.acm.org
Modern DBMS engines can achieve unprecedented transaction processing speeds thanks
to the invention of clever data structures, concurrency schemes, and improvements in CPU …

[PDF][PDF] LST-Bench: Benchmarking Log-Structured Tables in the Cloud

J Camacho-Rodríguez, A Agrawal… - Proceedings of the …, 2024 - dl.acm.org
Data processing engines increasingly leverage distributed file systems for scalable, cost-
effective storage. While the Apache Parquet columnar format has become a popular choice …

Towards a benchmark for learned systems

L Bindschaedler, A Kipf, T Kraska… - 2021 IEEE 37th …, 2021 - ieeexplore.ieee.org
This paper aims to initiate a discussion around benchmarking data management systems
with machine-learned components. Traditional benchmarks such as TPC or YCSB are …

Using eBPF for Database Workload Tracing: An Explorative Study

J Domaschka, S Volpert, K Maier, G Eisenhart… - Companion of the 2023 …, 2023 - dl.acm.org
Database management systems~(DBMS) are crucial architectural components of any
modern distributed software system. Yet, ensuring a smooth, high-performant operation of a …