Transactional memory: An overview

T Harris, A Cristal, OS Unsal, E Ayguade… - IEEE micro, 2007 - ieeexplore.ieee.org
Writing applications that benefit from the massive computational power of future multicore
chip multiprocessors will not be an easy task for mainstream programmers accustomed to …

A case for MLP-aware cache replacement

MK Qureshi, DN Lynch, O Mutlu, YN Patt - ACM SIGARCH Computer …, 2006 - dl.acm.org
Performance loss due to long-latency memory accesses can be reduced by servicing
multiple memory accesses concurrently. The notion of generating and servicing long-latency …

Data-centric computing frontiers: A survey on processing-in-memory

P Siegl, R Buchty, M Berekovic - Proceedings of the Second …, 2016 - dl.acm.org
A major shift from compute-centric to data-centric computing systems can be perceived, as
novel big data workloads like cognitive computing and machine learning strongly enforce …

Hybrid dataflow/von-Neumann architectures

F Yazdanpanah, C Alvarez-Martinez… - … on Parallel and …, 2013 - ieeexplore.ieee.org
General purpose hybrid dataflow/von-Neumann architectures are gaining attraction as
effective parallel platforms. Although different implementations differ in the way they merge …

QuantBayes: Weight optimization for memristive neural networks via quantization-aware Bayesian inference

Y Zhou, X Hu, L Wang, G Zhou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The memristor-based neuromorphic computing system (NCS) with emerging storage and
computing integration architecture has drawn extensive attention. Because of the unique …

A Hybrid Weight Quantization Strategy for Memristive Neural Networks

S Shen, S Duan, L Wang - Neurocomputing, 2023 - Elsevier
Due to the ability to store data and process information, the memristor-based neuromorphic
system has attracted extensive attention. Its efficient parallel computing approach allows it to …

Simultaneous speculative threading: A novel pipeline architecture implemented in sun's rock processor

S Chaudhry, R Cypher, M Ekman, M Karlsson… - ACM SIGARCH …, 2009 - dl.acm.org
This paper presents Simultaneous Speculative Threading (SST), which is a technique for
creating high-performance area-and power-efficient cores for chip multiprocessors. SST …

MLP-aware dynamic instruction window resizing for adaptively exploiting both ILP and MLP

Y Kora, K Yamaguchi, H Ando - Proceedings of the 46th Annual IEEE …, 2013 - dl.acm.org
It is difficult to improve the single-thread performance of a processor in memory-intensive
programs because processors have hit the memory wall, ie, the large speed discrepancy …

Late-binding: Enabling unordered load-store queues

S Sethumadhavan, F Roesner, JS Emer… - ACM SIGARCH …, 2007 - dl.acm.org
Conventional load/store queues (LSQs) are an impediment to both power-efficient execution
in superscalar processors and scaling tolarge-window designs. In this paper, we propose …

Implementing kilo-instruction multiprocessors

E Vallejo, M Galluzzi, A Cristal, F Vallejo… - ICPS'05 …, 2005 - ieeexplore.ieee.org
Multiprocessors are coming into wide-spread use in many application areas, yet there are a
number of challenges to achieving a good tradeoff between complexity and performance …