Specializing parallel data structures for Datalog

H Jordan, P Subotić, D Zhao… - … : Practice and Experience, 2022 - Wiley Online Library
Concurrency and Computation: Practice and Experience, 2022Wiley Online Library
We see a resurgence of Datalog in a variety of applications, including program analysis,
networking, data integration, cloud computing, and security. The large‐scale and complexity
of these applications need the efficient management of data in relations. Hence, Datalog
implementations require new data structures for managing relations that (1) are parallel,(2)
are highly specialized for Datalog evaluation, and (3) can accommodate different workloads
depending on the applications concerning memory consumption and computational …
Summary
We see a resurgence of Datalog in a variety of applications, including program analysis, networking, data integration, cloud computing, and security. The large‐scale and complexity of these applications need the efficient management of data in relations. Hence, Datalog implementations require new data structures for managing relations that (1) are parallel, (2) are highly specialized for Datalog evaluation, and (3) can accommodate different workloads depending on the applications concerning memory consumption and computational efficiency. In this article, we present a data structure framework for relations that is specialized for shared‐memory parallel Datalog implementations such as the soufflé Datalog compiler. The data structure framework permits a portfolio of different data structures depending on the workload. We also introduce two concrete parallel data structures for relations, designed for various workloads. Our benchmarks demonstrate a speed‐up of up to 6× by using a portfolio of data structures compared with using a B‐tree alone, showing the advantage of our data structure framework.
Wiley Online Library
以上显示的是最相近的搜索结果。 查看全部搜索结果