An analytical model for predicting the remaining battery capacity of lithium-ion batteries

P Rong, M Pedram - IEEE transactions on very large scale …, 2006 - ieeexplore.ieee.org
Predicting the residual energy of the battery source that powers a portable electronic device
is imperative in designing and applying an effective dynamic power management policy for …

Data and memory optimization techniques for embedded systems

PR Panda, F Catthoor, ND Dutt, K Danckaert… - ACM Transactions on …, 2001 - dl.acm.org
We present a survey of the state-of-the-art techniques used in performing data and memory-
related optimizations in embedded systems. The optimizations are targeted directly or …

Communication optimizations for fine-grained UPC applications

WY Chen, C Iancu, K Yelick - 14th International Conference on …, 2005 - ieeexplore.ieee.org
Global address space languages like UPC exhibit high performance and portability on a
broad class of shared and distributed memory parallel architectures. The most scalable …

Towards automatic translation of OpenMP to MPI

A Basumallik, R Eigenmann - Proceedings of the 19th annual …, 2005 - dl.acm.org
We present compiler techniques for translating OpenMP shared-memory parallel
applications into MPI message-passing programs for execution on distributed memory …

Gilgamesh: A multithreaded processor-in-memory architecture for petaflops computing

TL Sterling, HP Zima - SC'02: Proceedings of the 2002 ACM …, 2002 - ieeexplore.ieee.org
Processor-in-Memory (PIM) architectures avoid the von Neumann bottleneck in conventional
machines by integrating high-density DRAM and CMOS logic on the same chip. Parallel …

Automatic parallelization of recursive procedures

M Gupta, S Mukhopadhyay, N Sinha - International Journal of Parallel …, 2000 - Springer
Parallelizing compilers have traditionally focussed mainly on parallelizing loops. This paper
presents a new framework for automatically parallelizing recursive procedures that typically …

CC--MPI: a compiled communication capable MPI prototype for ethernet switched clusters

A Karwande, X Yuan, DK Lowenthal - ACM Sigplan Notices, 2003 - dl.acm.org
Compiled communication has recently been proposed to improve communication
performance for clusters of workstations. The idea of compiled communication is to apply …

Global communication analysis and optimization

S Chakrabarti, M Gupta, JD Choi - Proceedings of the ACM SIGPLAN …, 1996 - dl.acm.org
Reducing communication cost is crucial to achieving good performance on scalable parallel
machines. This paper presents a new compiler algorithm for global analysis and …

Communication optimizations for distributed-memory X10 programs

R Barik, J Zhao, D Grove, I Peshansky… - … Parallel & Distributed …, 2011 - ieeexplore.ieee.org
X10 is a new object-oriented PGAS (Partitioned Global Address Space) programming
language with support for distributed asynchronous dynamic parallelism that goes beyond …

Code transformations for data transfer and storage exploration preprocessing in multimedia processors

F Catthoor, K Danckaert, S Wuytack… - IEEE Design & Test of …, 2001 - ieeexplore.ieee.org
Platform-independent source code transformations can greatly help alleviate the data-
transfer and storage bottleneck. This article covers global data-flow, loop, and data-reuse …