[PDF][PDF] Daphne: An open and extensible system infrastructure for integrated data analysis pipelines

P Damme, M Birkenbach, C Bitsakos… - … on Innovative Data …, 2022 - pure.itu.dk
Integrated data analysis (IDA) pipelines---that combine data management (DM) and query
processing, high-performance computing (HPC), and machine learning (ML) training and …

Parallel logic programming: A sequel

A Dovier, A Formisano, G Gupta… - Theory and Practice of …, 2022 - cambridge.org
Multi-core and highly connected architectures have become ubiquitous, and this has
brought renewed interest in language-based approaches to the exploitation of parallelism …

Modularis: Modular relational analytics over heterogeneous distributed platforms

D Koutsoukos, I Müller, R Marroquín, A Klimovic… - arXiv preprint arXiv …, 2020 - arxiv.org
The enormous quantity of data produced every day together with advances in data analytics
has led to a proliferation of data management and analysis systems. Typically, these …

Topology-aware Parallel Joins

X Hu, P Koutris - Proceedings of the ACM on Management of Data, 2024 - dl.acm.org
We study the design and analysis of parallel join algorithms in a topology-aware
computational model. In this model, the network is modeled as a directed graph, where each …

Algorithms for a topology-aware massively parallel computation model

X Hu, P Koutris, S Blanas - Proceedings of the 40th ACM SIGMOD …, 2021 - dl.acm.org
Most of the prior work in massively parallel data processing assumes homogeneity, ie, every
computing unit has the same computational capability and can communicate with every …

Templating Shuffles

Q Zhang, J Wu, A Chen, V Liu, BT Loo - arXiv preprint arXiv:2207.10746, 2022 - arxiv.org
Cloud data centers are evolving fast. At the same time, today's large-scale data analytics
applications require non-trivial performance tuning that is often specific to the applications …

The Hardness of Optimization Problems on the Weighted Massively Parallel Computation Model

H Ma, J Li - International Computing and Combinatorics …, 2023 - Springer
Abstract The topology-aware Massively Parallel Computation (MPC) model is proposed and
studied recently, which enhances the classical MPC model by the awareness of network …

[PDF][PDF] Adapting database components to heterogeneous environments

D Koutsoukos - 2024 - research-collection.ethz.ch
Data management has seen rapid evolution during the last years, influenced by factors such
as data explosion, the prevalence of machine and deep learning, the slowdown of Moore's …

A New Model for Massively Parallel Computation Considering both Communication and IO Cost

H Ma, X Gao, J Li, T Gao - arXiv preprint arXiv:2203.12811, 2022 - arxiv.org
In the research area of parallel computation, the communication cost has been extensively
studied, while the IO cost has been neglected. For big data computation, the assumption that …

Construction of College Students' Course Management Information System Based on Data Center and Parallel Model

J Li - … Conference on Sustainable Computing and Data …, 2022 - ieeexplore.ieee.org
Construction of the college students' course management information system based on data
center and parallel model is discussed in this research. First, this research work continues to …