Programmable Gate Arrays. This allows to use application specific multi-core architectures
to run parts of the application in parallel. Automatic parallelization of embedded software is
desired to make optimal use of these flexible multi-core architectures. In many programs,
loops hold a majority of the total execution time and benefit from parallelization. In this
contribution, we present a loop transformation tool that improves the parallelizability of …