[图书][B] Computer architecture: a quantitative approach

JL Hennessy, DA Patterson - 2011 - books.google.com
Computer Architecture: A Quantitative Approach, Fifth Edition, explores the ways that
software and technology in the cloud are accessed by digital media, such as cell phones …

Runahead execution: An alternative to very large instruction windows for out-of-order processors

O Mutlu, J Stark, C Wilkerson… - The Ninth International …, 2003 - ieeexplore.ieee.org
Today's high performance processors tolerate long latency operations by means of out-of-
order execution. However, as latencies increase, the size of the instruction window must …

Memory dependence prediction using store sets

GZ Chrysos, JS Emer - ACM SIGARCH Computer Architecture News, 1998 - dl.acm.org
For maximum performance, an out-of-order processor must issue load instructions as early
as possible, while avoiding memory-order violations with prior store instructions that write to …

Transient-Execution Attacks: A Computer Architect Perspective

L Fiolhais, L Sousa - ACM Computing Surveys, 2023 - dl.acm.org
Computer architects employ a series of performance optimizations at the micro-architecture
level. These optimizations are meant to be invisible to the programmer but they are implicitly …

Dependence based prefetching for linked data structures

A Roth, A Moshovos, GS Sohi - … of the eighth international conference on …, 1998 - dl.acm.org
We introduce a dynamic scheme that captures the accesspat-terns of linked data structures
and can be used to predict future accesses with high accuracy. Our technique exploits the …

Speculative data-driven multithreading

A Roth, GS Sohi - Proceedings HPCA Seventh International …, 2001 - ieeexplore.ieee.org
Mispredicted branches and loads that miss in the cache cause the majority of retirement
stalls experienced by sequential processors; we call these critical instructions. Despite their …

JETTY: Filtering snoops for reduced energy consumption in SMP servers

A Moshovos, G Memik, B Falsafi… - … Symposium on High …, 2001 - ieeexplore.ieee.org
We propose methods for reducing the energy consumed by snoop requests in snoopy bus-
based symmetric multiprocessor (SMP) systems. Observing that a large fraction of snoops …

Decoupled software pipelining with the synchronization array

R Rangan, N Vachharajani… - … , 2004. PACT 2004., 2004 - ieeexplore.ieee.org
Despite the success of instruction-level parallelism (ILP) optimizations in increasing the
performance of microprocessors, certain codes remain elusive. In particular, codes …

Speculation techniques for improving load related instruction scheduling

A Yoaz, M Erez, R Ronen, S Jourdan - Proceedings of the 26th annual …, 1999 - dl.acm.org
State of the art microprocessors achieve high performance by executing multiple instructions
per cycle. In an out-of-order engine, the instruction scheduler is responsible for dispatching …

SpecuSym: Speculative symbolic execution for cache timing leak detection

S Guo, Y Chen, P Li, Y Cheng, H Wang, M Wu… - Proceedings of the ACM …, 2020 - dl.acm.org
CPU cache is a limited but crucial storage component in modern processors, whereas the
cache timing side-channel may inadvertently leak information through the physically …