Clip: Load criticality based data prefetching for bandwidth-constrained many-core systems

B Panda - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Hardware prefetching is a latency-hiding technique that hides the costly off-chip DRAM
accesses. However, state-of-the-art prefetchers fail to deliver performance improvement in …

CRISP: critical slice prefetching

H Litz, G Ayers, P Ranganathan - Proceedings of the 27th ACM …, 2022 - dl.acm.org
The high access latency of DRAM continues to be a performance challenge for
contemporary microprocessor systems. Prefetching is a well-established technique to …

Register file prefetching

S Shukla, S Bandishte, J Gaur… - Proceedings of the 49th …, 2022 - dl.acm.org
The memory wall continues to limit the performance of modern out-of-order (OOO)
processors, despite the expensive provisioning of large multi-level caches and …

Instruction criticality based energy-efficient hardware data prefetching

NS Kalani, B Panda - IEEE Computer Architecture Letters, 2021 - ieeexplore.ieee.org
Hardware data prefetching is a latency hiding technique that mitigates the memory wall
problem by fetching data blocks into caches before the processor demands them. For high …

Leveraging targeted value prediction to unlock new hardware strength reduction potential

A Perais - MICRO-54: 54th Annual IEEE/ACM International …, 2021 - dl.acm.org
Value Prediction (VP) is a microarchitectural technique that speculatively breaks data
dependencies to increase the available Instruction Level Parallelism (ILP) in general …

Calipers: a criticality-aware framework for modeling processor performance

H Golestani, R Sen, V Young, G Gupta - Proceedings of the 36th ACM …, 2022 - dl.acm.org
Computer architecture design space is vast and complex. Tools are needed to explore new
ideas and gain insights quickly, at low effort and desired accuracy. Cycle Accurate …

Stride equality prediction for value speculation

L Yang, L Huang, R Yan, N Xiao, S Ma… - IEEE Computer …, 2022 - ieeexplore.ieee.org
Stride prediction is a nonignorable part of value prediction. Most hybrid predictors cannot
eliminate this pattern. Existing stride predictors can handle the regular stride pattern, but it is …

Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution

R Bera, A Ranganathan, J Rakshit, S Mahto… - arXiv preprint arXiv …, 2024 - arxiv.org
Load instructions often limit instruction-level parallelism (ILP) in modern processors due to
data and resource dependences they cause. Prior techniques like Load Value Prediction …

[PDF][PDF] 处理器值预测技术研究

黄立波, 杨凌, 杨乾明, 马胜, 王永文, 隋兵才, 沈立… - 电子学报, 2023 - ejournal.org.cn
当今的处理器性能与存储器带宽和延迟严重失衡的问题限制了计算系统的整体性能,
而存储器的性能对制程工艺不敏感, 在后摩尔时代下很难再通过集成电路制造工艺的迭代获得 …

A dependence graph pattern mining method for processor performance analysis

Y Zheng, C Han, T Zhang, F Zhang, J Wang - Performance Evaluation, 2024 - Elsevier
As the complexity of processor microarchitecture and applications increases, obtaining
performance optimization knowledge, such as critical dependent chains, becomes more …