Clip: Load criticality based data prefetching for bandwidth-constrained many-core systems

B Panda - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Hardware prefetching is a latency-hiding technique that hides the costly off-chip DRAM
accesses. However, state-of-the-art prefetchers fail to deliver performance improvement in …

RL-CoPref: a reinforcement learning-based coordinated prefetching controller for multiple prefetchers

H Yang, J Fang, X Su, Z Cai, Y Wang - The Journal of Supercomputing, 2024 - Springer
Modern processors employ data prefetchers to alleviate the impact of long memory access
latency. However, current prefetchers are designed for specific memory access patterns …

Rowhammer Cache: A Last-Level Cache for Low-Overhead Rowhammer Tracking

A Singh, B Panda - … on Hardware Oriented Security and Trust …, 2024 - ieeexplore.ieee.org
The rowhammer attack on modern DRAM systems is here to stay as the number of row
activations required to induce a DRAM bit flip (rowhammer threshold) is following a trend of …

CDPM: Context-directed pattern matching prefetching to improve coarse-grained reconfigurable array performance

L Liu, C Yang, S Yin, S Wei - IEEE Transactions on Computer …, 2017 - ieeexplore.ieee.org
Coarse-grained reconfigurable arrays (CGRAs) can be dynamically programmed by
configuration contexts to concurrently run multiple operations on a processing elements …

Thread criticality assisted replication and migration for chip multiprocessor caches

J Li, M Li, CJ Xue, Y Ouyang… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Non-Uniform Cache Architecture (NUCA) is a viable solution to mitigate the problem of large
on-chip wire delay due to the rapid increase in the cache capacity of chip multiprocessors …

Data cache prefetching via context directed pattern matching for coarse-grained reconfigurable arrays

C Yang, L Liu, S Yin, S Wei - Proceedings of the 53rd Annual Design …, 2016 - dl.acm.org
This paper proposes a context directed pattern matching (CDPM) mechanism, which
employs the context of the coarse-grained reconfigurable arrays (CGRAs) as a guide to …

Sb-fetch: Synchronization aware hardware prefetching for chip multiprocessors

LM AlBarakat, PV Gratz, DA Jiménez - Proceedings of the 34th ACM …, 2020 - dl.acm.org
Shared-memory, multi-threaded applications often require programmers to insert thread
synchronization primitives (ie locks, barriers, and condition variables) in critical sections to …

A Fairness-Aware Prefetching Mechanism based on Reinforcement Learning for Multi-Core Systems

H Yang, J Fang - 2023 IEEE International Conference on High …, 2023 - ieeexplore.ieee.org
In modern multi-core computer architectures, composite prefetching techniques exhibit
promising potential in efficiently handling diverse memory access patterns. However, the …

Speculative Techniques for Memory Hierarchy Management

LM Albarakat - 2021 - search.proquest.com
Abstract The “Memory Wall”, is the gap in performance between the processor and the main
memory. Over the last 30 years computer architects have added multiple levels of cache to …