Rheem: enabling cross-platform data processing: may the big data be with you!

D Agrawal, S Chawla, B Contreras-Rojas… - Proceedings of the …, 2018 - dl.acm.org
Solving business problems increasingly requires going beyond the limits of a single data
processing platform (platform for short), such as Hadoop or a DBMS. As a result …

On-demand state separation for cloud data warehousing

C Winter, J Giceva, T Neumann, A Kemper - Proceedings of the VLDB …, 2022 - dl.acm.org
Moving data analysis and processing to the cloud is no longer reserved for a few companies
with petabytes of data. Instead, the flexibility of on-demand resources is attracting an …

ML-based cross-platform query optimization

Z Kaoudi, JA Quiané-Ruiz… - 2020 IEEE 36th …, 2020 - ieeexplore.ieee.org
Cost-based optimization is widely known to suffer from a major weakness: administrators
spend a significant amount of time to tune the associated cost models. This problem only …

Unified data analytics: state-of-the-art and open problems

Z Kaoudi, JA Quiané-Ruiz - Proceedings of the VLDB Endowment, 2022 - dl.acm.org
There is an urgent need for unifying data analytics as more and more application tasks
become more complex: Nowadays, it is normal to see tasks performing data preparation …

Justice: A deadline-aware, fair-share resource allocator for implementing multi-analytics

S Dimopoulos, C Krintz, R Wolski - 2017 IEEE international …, 2017 - ieeexplore.ieee.org
In this paper, we present Justice, a fair-share deadline-aware resource allocator for big data
cluster managers. In resource constrained environments, where resource contention …

[图书][B] Heterogeneous computing architectures: Challenges and vision

O Terzo, K Djemame, A Scionti, C Pezuela - 2019 - books.google.com
Heterogeneous Computing Architectures: Challenges and Vision provides an updated
vision of the state-of-the-art of heterogeneous computing systems, covering all the aspects …

RHEEMix in the data jungle: a cost-based optimizer for cross-platform systems

S Kruse, Z Kaoudi, B Contreras-Rojas, S Chawla… - The VLDB Journal, 2020 - Springer
Data analytics are moving beyond the limits of a single platform. In this paper, we present
the cost-based optimizer of Rheem, an open-source cross-platform system that copes with …

Cross-platform data processing: use cases and challenges

Z Kaoudi, JA Quiané-Ruiz - 2018 IEEE 34th International …, 2018 - ieeexplore.ieee.org
There is a zoo of data processing platforms which help users and organizations to extract
value out of their data. Although each of these platforms excels in specific aspects, users …

Pythia: Admission control for multi-framework, deadline-driven, big data workloads

S Dimopoulos, C Krintz, R Wolski - 2017 IEEE 10th …, 2017 - ieeexplore.ieee.org
In this paper, we present PYTHIA, deadline-aware admission control for systems that
execute jobs from multiple big data (batch) frameworks using shared resources. PYTHIA …

Optimizing cross-platform data movement

S Kruse, Z Kaoudi, JA Quiané-Ruiz… - 2019 IEEE 35th …, 2019 - ieeexplore.ieee.org
Data analytics are moving beyond the limits of a single data processing platform. A cross-
platform query optimizer is necessary to enable applications to run their tasks over multiple …