Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model

U Bondhugula, M Baskaran, S Krishnamoorthy… - … CC 2008, Held as Part of …, 2008 - Springer
The polyhedral model provides powerful abstractions to optimize loop nests with regular
accesses. Affine transformations in this model capture a complex sequence of execution …

Maximizing parallelism and minimizing synchronization with affine transforms

AW Lim, MS Lam - Proceedings of the 24th ACM SIGPLAN-SIGACT …, 1997 - dl.acm.org
This paper presents the first algorithm to find the optimal affine transform that maximizes the
degree of parallelism while minimizing the degree of synchronization in a program with …

Using time skewing to eliminate idle time due to memory bandwidth and network limitations

D Wonnacott - Proceedings 14th International Parallel and …, 2000 - ieeexplore.ieee.org
Time skewing is a compile-time optimization that can provide arbitrarily high cache hit rates
for a class of iterative calculations, given a sufficient number of time steps and sufficient …

Code generation for multiple mappings

W Kelly, W Pugh, E Rosser - … ' 95. The Fifth Symposium on the …, 1995 - ieeexplore.ieee.org
There has been a great amount of recent work toward unifying iteration reordering
transformations. Many of these approaches represent transformations as affine mappings …

Counting solutions to presburger formulas: How and why

W Pugh - ACM SIGPLAN Notices, 1994 - dl.acm.org
We describe methods that are able to count the number of integer solutions to selected free
variables of a Presburger formula, or sum a polynomial over all integer solutions of selected …

Achieving scalable locality with time skewing

D Wonnacott - International Journal of Parallel Programming, 2002 - Springer
Microprocessor speed has been growing exponentially faster than memory system speed in
the recent past. This paper explores the long term implications of this trend. We define …

Time skewing: A value-based approach to optimizing for memory locality

J McCalpin, D Wonnacott - 1998 - scholarship.libraries.rutgers.edu
As the gap between proessor and main memory speed ontinues to grow, higher ahe hit
rates are required for eÆient proessor use. Re-ent work on ompile-time transformations to …

A unifying framework for iteration reordering transformations

W Kelly, W Pugh - … on Algorithms and Architectures for Parallel …, 1995 - ieeexplore.ieee.org
We present a framework for unifying iteration reordering transformations such as loop
interchange, loop distribution, skewing, tiling, index set splitting and statement reordering …

Time skewing for parallel computers

D Wonnacott - International Workshop on Languages and Compilers …, 1999 - Springer
Time skewing [Won99a] is a compile-time optimization that can achieve scalable locality for
a class of iterative stencil calculations, given a sufficient number of time steps and sufficient …

[PS][PS] The Omega calculator and library, version 1.1. 0

W Kelly, V Maslov, W Pugh, E Rosser… - College Park …, 1996 - users.cs.utah.edu
This document gives an overview of the Omega library and describes the Omega Calculator,
a text-based interface to the Omega library. A separate document describes the C++ …