Cost models for big data query processing: Learning, retrofitting, and our findings

T Siddiqui, A Jindal, S Qiao, H Patel, W Le - Proceedings of the 2020 …, 2020 - dl.acm.org
Query processing over big data is ubiquitous in modern clouds, where the system takes care
of picking both the physical query execution plans and the resources needed to run those …

{OPTIMUSCLOUD}: Heterogeneous configuration optimization for distributed databases in the cloud

A Mahgoub, AM Medoff, R Kumar, S Mitra… - 2020 USENIX Annual …, 2020 - usenix.org
Achieving cost and performance efficiency for cloud-hosted databases requires exploring a
large configuration space, including the parameters exposed by the database along with the …

Finding the right cloud configuration for analytics clusters

M Bilal, M Canini, R Rodrigues - … of the 11th ACM Symposium on Cloud …, 2020 - dl.acm.org
Finding good cloud configurations for deploying a single distributed system is already a
challenging task, and it becomes substantially harder when a data analytics cluster is …

Unearthing inter-job dependencies for better cluster scheduling

A Chung, S Krishnan, K Karanasos, C Curino… - … USENIX Symposium on …, 2020 - usenix.org
Inter-job dependencies pervade shared data analytics infrastructures (so-called``data
lakes''), as jobs read output files written by previous jobs, yet are often invisible to current …

SCYLLA: QoE-aware continuous mobile vision with FPGA-based dynamic deep neural network reconfiguration

S Jiang, Z Ma, X Zeng, C Xu, M Zhang… - … -IEEE Conference on …, 2020 - ieeexplore.ieee.org
Continuous mobile vision is becoming increasingly important as it finds compelling
applications which substantially improve our everyday life. However, meeting the …

Autotoken: Predicting peak parallelism for big data analytics at microsoft

R Sen, A Jindal, H Patel, S Qiao - Proceedings of the VLDB Endowment, 2020 - dl.acm.org
Right-sizing resource allocation for big-data queries, particularly in serverless environments,
is critical for improving infrastructure operational efficiency, capacity availability, query …

A black-box fork-join latency prediction model for data-intensive applications

M Nguyen, S Alesawi, N Li, H Che… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
The workflows of the predominant datacenter services are underlaid by various Fork-Join
structures. Due to the lack of good understanding of the performance of Fork-Join structures …

Budgeted coupon advertisement problem: Algorithm and robust analysis

J Guo, T Chen, W Wu - IEEE Transactions on Network Science …, 2020 - ieeexplore.ieee.org
Coupon advertisement is everywhere in people's daily lives, and it is a common marketing
strategy adopted by merchants. A problem, Budget Profit Maximization with Coupon …

A general framework for handling commitment in online throughput maximization

L Chen, F Eberle, N Megow, K Schewior… - Mathematical …, 2020 - Springer
We study a fundamental online job admission problem where jobs with deadlines arrive
online over time at their release dates, and the task is to determine a preemptive single …

Towards plan-aware resource allocation in serverless query processing

M Bag, A Jindal, H Patel - 12th USENIX Workshop on Hot Topics in …, 2020 - usenix.org
Resource allocation for serverless query processing is a challenge. Unfortunately, prior
approaches have treated queries as black boxes, thereby missing significant resource …