Source-to-source compilation targeting OpenMP-based automatic parallelization of C applications

H Arabnejad, J Bispo, JMP Cardoso… - The Journal of …, 2020 - Springer
Directive-driven programming models, such as OpenMP, are one solution for exploring the
potential parallelism when targeting multicore architectures. Although these approaches …

Time and Energy Benefits of Using Automatic Optimization Compilers for NPDP Tasks

M Palkowski, M Gruzewski - Electronics, 2023 - mdpi.com
In this article, we analyze the program codes generated automatically using three advanced
optimizers: Pluto, Traco, and Dapt, which are specifically tailored for the NPDP benchmark …

Autopar-clava: An automatic parallelization source-to-source tool for c code applications

H Arabnejad, J Bispo, JG Barbosa… - Proceedings of the 9th …, 2018 - dl.acm.org
Automatic parallelization of sequential code has become increasingly relevant in multicore
programming. In particular, loop parallelization continues to be a promising optimization …

NPDP benchmark suite for the evaluation of the effectiveness of automatic optimizing compilers

M Palkowski, W Bielecki - Parallel Computing, 2023 - Elsevier
The paper presents a benchmark suite of ten non-serial polyadic dynamic programming
(NPDP) kernels, which are designed to test the efficiency of tiled code generated by …

TRACO: An automatic loop nest parallelizer for numerical applications

M Palkowski, T Klimek… - … Federated Conference on …, 2015 - ieeexplore.ieee.org
We present the source-to-source TRACO compiler allowing for increasing program locality
and parallelizing arbitrarily nested loop sequences in numerical applications. Algorithms for …

Efficiency of Various Tiling Strategies for the Zuker Algorithm Optimization

P Blaszynski, M Palkowski, W Bielecki, M Poliwoda - Mathematics, 2024 - mdpi.com
This paper focuses on optimizing the Zuker RNA folding algorithm, a bioinformatics task with
non-serial polyadic dynamic programming and non-uniform loop dependencies. The …

Using basis dependence distance vectors in the modified floyd–warshall algorithm

W Bielecki, K Kraska, T Klimek - Journal of Combinatorial Optimization, 2015 - Springer
In this paper, we present a modified Floyd–Warshall algorithm, where the most time-
consuming part—calculating transitive closure describing self-dependences for each loop …

Parallelization of an encryption algorithm based on a spatiotemporal chaotic system and a chaotic neural network

D Burak - Procedia Computer Science, 2015 - Elsevier
In this paper the results of parallelizing a block cipher based on a spatiotemporal chaotic
system and a chaotic neural network are presented. A data dependence analysis of loops …

An OpenMP based parallelization compiler for C applications

H Arabnejad, J Bispo, JG Barbosa… - 2018 IEEE Intl Conf …, 2018 - ieeexplore.ieee.org
Directive-drive programming models, such as OpenMP, are one solution for exploiting the
potential of multicore architectures, and enable developers to accelerate software …

Parallelization of the IDEA Algorithm

V Beletskyy, D Burak - International Conference on Computational …, 2004 - Springer
In this paper, we present results of parallelizing the International Data Encryption Algorithm
(IDEA). The data dependence analysis of loops was applied in order to parallelize this …