Effective Context-Sensitive Memory Dependence Prediction

SS Kim, A Ros - 2024 IEEE International Symposium on High …, 2024 - ieeexplore.ieee.org
Memory dependence prediction is a fundamental technique to increase instruction-and
memory-level parallelism in out-of-order processors, which are crucial for high performance …

[HTML][HTML] Speculative inter-thread store-to-load forwarding in SMT architectures

J Feliu, A Ros, ME Acacio, S Kaxiras - Journal of Parallel and Distributed …, 2023 - Elsevier
Applications running on out-of-order cores have benefited for decades of store-to-load
forwarding which accelerates communication of store values to loads of the same thread …

CASINO core microarchitecture: Generating out-of-order schedules using cascaded in-order scheduling windows

I Jeong, S Park, C Lee, WW Ro - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
The performance gap between in-order (InO) and out-of-order (OoO) cores comes from the
ability to dynamically create highly optimized instruction issue schedules. In this work, we …

Criticality-based optimizations for efficient load processing

S Subramaniam, A Bracy, H Wang… - 2009 IEEE 15th …, 2009 - ieeexplore.ieee.org
Some instructions have more impact on processor performance than others. Identification of
these critical instructions can be used to modify and improve instruction processing …

The superfluous load queue

A Ros, S Kaxiras - 2018 51st Annual IEEE/ACM International …, 2018 - ieeexplore.ieee.org
In an out-of-order core, the load queue (LQ), the store queue (SQ), and the store buffer (SB)
are responsible for ensuring: i) correct forwarding of stores to loads and ii) correct ordering …

Microarchitecture soft error vulnerability characterization and mitigation under 3D integration technology

W Zhang, T Li - 2008 41st IEEE/ACM International Symposium …, 2008 - ieeexplore.ieee.org
As semiconductor processing techniques continue to scale down, transient faults, also
known as soft errors, are increasingly becoming a reliability threat to high-performance …

Reconstructing Out-of-Order Issue Queue

I Jeong, J Lee, MK Yoon, WW Ro - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Out-of-order cores provide high performance at the cost of energy efficiency. Dynamic
scheduling is one of the major contributors to this: generating highly optimized issue …

Dynamic data dependence tracking and its application to branch prediction

L Chen, S Dropsho, DH Albonesi - The Ninth International …, 2003 - ieeexplore.ieee.org
To continue to improve processor performance, microarchitects seek to increase the
effective instruction level parallelism (ILP) that can be exploited in applications. A …

Itslf: Inter-thread store-to-load forwardingin simultaneous multithreading

J Feliu, A Ros, ME Acacio, S Kaxiras - MICRO-54: 54th Annual IEEE …, 2021 - dl.acm.org
In this paper, we argue that, for a class of fine-grain, synchronization-intensive, parallel
workloads, it is advantageous to consolidate synchronization and communication as much …

Speculative enforcement of store atomicity

A Ros, S Kaxiras - 2020 53rd Annual IEEE/ACM International …, 2020 - ieeexplore.ieee.org
Various memory consistency model implementations (eg, x86, SPARC) willfully allow a core
to see its own stores while they are in limbo, ie, executed (and perhaps retired) but not yet …