Deep configuration performance learning: A systematic survey and taxonomy

J Gong, T Chen - ACM Transactions on Software Engineering and …, 2024 - dl.acm.org
Performance is arguably the most crucial attribute that reflects the quality of a configurable
software system. However, given the increasing scale and complexity of modern software …

MLOS in Action: Bridging the Gap Between Experimentation and Auto-Tuning in the Cloud

B Kroth, S Matusevych, R Alotaibi, Y Zhu… - Proceedings of the …, 2024 - dl.acm.org
This paper presents MLOS (ML Optimized Systems), a flexible framework that bridges the
gap between benchmarking, experimentation, and optimization of software systems. It …

ReAcTable: Enhancing ReAct for Table Question Answering

Y Zhang, J Henkel, A Floratou, J Cahoon… - arXiv preprint arXiv …, 2023 - arxiv.org
Table Question Answering (TQA) presents a substantial challenge at the intersection of
natural language processing and data analytics. This task involves answering natural …

A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency Validation

S Simon, A Mailach, J Dorn, N Siegmund - arXiv preprint arXiv …, 2024 - arxiv.org
Retrieval-augmented generation (RAG) is an umbrella of different components, design
decisions, and domain-specific adaptations to enhance the capabilities of large language …

Large Language Models: Principles and Practice

I Trummer - 2024 IEEE 40th International Conference on Data …, 2024 - ieeexplore.ieee.org
The last few years have been marked by several breakthroughs in the domain of generative
AI. Large language models such as GPT-4 are able to solve a plethora of tasks, ranging from …

DBG-PT: A Large Language Model Assisted Query Performance Regression Debugger

V Giannakouris, I Trummer - Proceedings of the VLDB Endowment, 2024 - dl.acm.org
In this paper we explore the ability of Large Language Models (LLMs) in analyzing and
comparing query plans, and resolving query performance regressions. We present DBG-PT …

Demonstrating λ-tune: Exploiting large language models for workload-adaptive database system tuning

V Giannakouris, I Trummer - … of the 2024 International Conference on …, 2024 - dl.acm.org
We demonstrate λ-Tune, a tool that leverages Large Language Models (LLMs) for
automated, workload-adaptive database system tuning. λ-Tune harnesses the ability of …

Revisiting Data Analysis with Pre-trained Foundation Models

C Liang, D Yang, Z Liang, Z Liang, T Zhang… - arXiv preprint arXiv …, 2025 - arxiv.org
Data analysis focuses on harnessing advanced statistics, programming, and machine
learning techniques to extract valuable insights from vast datasets. An increasing volume …

DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud

Q Wang, T Lan, Y Tang, B Sang, Z Huang… - Proceedings of the …, 2024 - dl.acm.org
Deep learning recommendation models (DLRM) rely on large embedding tables to manage
categorical sparse features. Expanding such embedding tables can significantly enhance …

Towards SLO-Optimized LLM Serving via Automatic Inference Engine Tuning

K Cheng, Z Wang, W Hu, T Yang, J Li… - arXiv preprint arXiv …, 2024 - arxiv.org
A service-level objective (SLO) is a target performance metric of service that cloud vendors
aim to ensure. Delivering optimized SLOs can enhance user satisfaction and improve the …