Optimizing off-chip accesses in multicores

W Ding, X Tang, M Kandemir, Y Zhang… - Proceedings of the 36th …, 2015 - dl.acm.org
In a network-on-chip (NoC) based manycore architecture, an off-chip data access (main
memory access) needs to travel through the on-chip network, spending considerable …

ParaCite: auto-parallelization of a sequential program using the program dependence graph

S Kalyur, GS Nagaraja - 2016 International conference on …, 2016 - ieeexplore.ieee.org
Parallelization is an attractive and popular means of speeding up a given program.
Parallelization can be carried out at different levels. Instruction level parallelization exploits …

Mining for localization in android

LA Reina, G Robles - 2012 9th IEEE Working Conference on …, 2012 - ieeexplore.ieee.org
Localization, and in particular translation, is a key aspect of modern end-user software
applications. Open source systems have traditionally taken advantage of distributed and …

Automatically optimizing stencil computations on many-core NUMA architectures

PH Lin, Q Yi, D Quinlan, C Liao, Y Yan - … NY, USA, September 28-30, 2016 …, 2017 - Springer
This paper presents a system for automatically supporting the optimization of stencil kernels
on emerging Non-Uniform Memory Access (NUMA) many-core architectures, through a …

A survey of modeling techniques used in compiler design and implementation

S Kalyur, GS Nagaraja - 2016 International Conference on …, 2016 - ieeexplore.ieee.org
Compiler design is a complex and a mathematically precise activity. As such, mathematical
modeling plays a crucial part, in the compiler design process. Traditionally, compiler design …

Concerto: a program parallelization, orchestration and distribution infrastructure

S Kalyur, GS Nagaraja - 2017 2nd international conference on …, 2017 - ieeexplore.ieee.org
The important step in Program Parallelization, is identifying the pieces of the given program,
that can be run concurrently, on separate processing elements. The parallel pieces once …

Toward automatic data distribution for migrating computations

L Pan, J Xie, MK Lai, MB Dillencourt… - … Conference on Parallel …, 2007 - ieeexplore.ieee.org
Program parallelization requires mapping computation and data to processing elements.
Navigational Programming (NavP), based on the principle of migrating computations, offers …

Data locality and parallelism optimization using a constraint-based approach

O Ozturk - Journal of Parallel and Distributed Computing, 2011 - Elsevier
Embedded applications are becoming increasingly complex and processing ever-increasing
datasets. In the context of data-intensive embedded applications, there have been two …

[PDF][PDF] Techniques for Memory Mapping on Multi-Core Automotive Embedded Systems

R Amarnath - 2020 - Citeseer
The demand to increase performance while conserving power has led to the invention of
multicore systems. The software until now had the convenience of gaining better …

CALIPER: a coarse grain parallel performance estimator and predictor

S Kalyur, GS Nagaraja - Emerging Technologies in Computing: Third EAI …, 2020 - Springer
Abstract Empirical studies of Program Performance, are limited by the choice and the
resulting bias, from the input samples used in the experiment. Estimation and Prediction …