Migration-aware loop retiming for STT-RAM-based hybrid cache in embedded systems

K Qiu, M Zhao, Q Li, C Fu… - IEEE Transactions on …, 2014 - ieeexplore.ieee.org
… a loop retiming framework during compilation to reduce the migration overhead by changing
the interleaved memory access pattern. With the proposed loop retiming … of hybrid cache is …

Balanced loop retiming to effectively architect STT-RAM-based hybrid cache for VLIW processors

K Qiu, W Zhang, X Wu, X Zhu, J Wang, Y Xu… - Proceedings of the 31st …, 2016 - dl.acm.org
… NVM-based hybrid cache while the migration-aware loop retiming approaches focusing on
… balanced loop retiming approach to effectively architect the STT-RAM-based hybrid cache for …

Brloop: Constructing balanced retimed loop to architect stt-ram-based hybrid cache for vliw processors

K Qiu, Y Zhu, Y Xu, Q Huo, CJ Xue - Microelectronics Journal, 2019 - Elsevier
… -based hybrid cache while the migration-aware loop retimingloop retiming to achieve a
comprehensive optimization considering ILP of ALUs and migration overhead of hybrid cache

Migration-aware loop retiming for STT-RAM based hybrid cache for embedded systems

K Qiu, M Zhao, C Fu, L Shi… - 2013 IEEE 24th …, 2013 - ieeexplore.ieee.org
loop retiming approach to reduce migrations in loops and thereby improve the energy efficiency
and performance of STTRAM based hybrid cache … very large in loops with intensive data …

Iterational retiming with partitioning: Loop scheduling with complete memory latency hiding

CJ Xue, J Hu, Z Shao, E Sha - ACM Transactions on Embedded …, 2010 - dl.acm.org
… proposed—from intermediate fast memories (caches) to various prefetching and memory …
propose a new loop scheduling with memory management technique, Iterational Retiming with …

Loop optimization in presence of stt-mram caches: A study of performance-energy tradeoffs

PY Péneau, R Bouziane, A Gamatié… - … Power and Timing …, 2016 - ieeexplore.ieee.org
… data migration in hybrid SRAM/STT-MRAM caches. They also focus on hybrid main memory
… While all above studies mainly focus on the hybrid nature of specific cache levels, the scope …

Hycsim: A rapid design space exploration tool for emerging hybrid last-level caches

C Escuin, AA Khan, P Ibañez, T Monreal… - System Engineering for …, 2022 - dl.acm.org
… state-of-the-art cache simulators supporting different types (functional, timing), modes (trace…
for hybrid cache designs, HyCSim has built-in support for it. Every block has a loop block bit …

Performance optimization for parallel systems with shared DWM via retiming, loop scheduling, and data placement

S Gao, S Gu, R Xu, EHM Sha, Q Zhuge - Journal of Systems Architecture, 2021 - Elsevier
Retiming Repetition (TRR) algorithm to combine the retiming … the execution time when
compared to results without retiming. … , we reduce loop execution time by retiming some nodes to …

Hybrid compiler/hardware prefetching for multiprocessors using low-overhead cache miss traps

J Skeppstedt, M Dubois - Proceedings of the 1997 International …, 1997 - ieeexplore.ieee.org
… in the second-level cache generate a cache miss trap. Each … engine in software by a loop in
the trap handler which issues … system architectural simulator with a detailed timing model [3]. …

Hybrid approach on cache aware real-time scheduling for multi-core systems

T Huybrechts, Y De Bock, H Li, P Hellinckx - Advances on P2P, Parallel …, 2017 - Springer
… For example, a while-loop will … hybrid methodology. For these tests, we have selected two
testbenches from the TACLeBench to perform timing measurements on their generated hybrid