[PDF][PDF] Speculative multithreaded processors

P Marcuello, A González, J Tubella - Proceedings of the 12th …, 1998 - dl.acm.org
In this paper we present a novel processor microarchitecture that relieves, four of the most
important bottlenecks of superscalar processors: the serialization imposed by true …

Disjoint eager execution: An optimal form of speculative execution

AK Uht, V Sindagi, K Hall - Proceedings of the 28th annual …, 1995 - ieeexplore.ieee.org
Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be
possible using the techniques described herein. Traditional speculative code execution is …

Improving superscalar instruction dispatch and issue by exploiting dynamic code sequences

S Vajapeyam, T Mitra - ACM SIGARCH Computer Architecture News, 1997 - dl.acm.org
Superscalar processors currently have the potential to fetch multiple basic blocks per cycle
by employing one of several recently proposed instruction fetch mechanisms. However, this …

Speculative dynamic vectorization

A Pajuelo, A González, M Valero - ACM SIGARCH Computer …, 2002 - dl.acm.org
Traditional vector architectures have shown to be very effective for regular codes where the
compiler can detect data-level parallelism. However, this SIMD parallelism is also present in …

Dynamic vectorization: a mechanism for exploiting far-flung ILP in ordinary programs

S Vajapeyam, PJ Joseph, T Mitra - ACM SIGARCH Computer …, 1999 - dl.acm.org
Several ILP limit studies indicate the presence of considerable ILP across dynamically far-
apart instructions in program execution. This paper proposes a hardware mechanism …

Exploiting speculative thread-level parallelism on a SMT processor

P Marcuello, A González - International Conference on High-Performance …, 1999 - Springer
In this paper we present a run-time mechanism to simultaneously execute multiple threads
from a sequential program on a simultaneous multithreaded (SMT) processor. The threads …

A theory of reduced and minimal procedural dependencies

AK Uht - IEEE Transactions on Computers, 1991 - computer.org
A reduced set of procedural dependencies is presented which is necessary and sufficient to
describe all procedural dependencies in standard imperative codes. Hence, the set is …

[图书][B] Hardware and software for functional and fine-grain parallelism

CJ Beckmann - 1993 - search.proquest.com
This thesis examines nonloop parallelism at both fine and coarse levels of granularity in
numerical FORTRAN programs. Measurements of the extent of this functional parallelism in …

[PDF][PDF] Levo-a scalable processor with high IPC

AK Uht, D Morano, A Khalafi, DR Kaeli - Journal of Instruction-Level …, 2003 - jilp.org
The Levo high IPC microarchitecture is described and evaluated. Levo employs instruction
time-tags and Active Stations to ensure correct operation in a rampantly speculative and out …

Extraction of massive instruction level parallelism

AK Uht - ACM SIGARCH Computer Architecture News, 1993 - dl.acm.org
Our goal is to dramatically increase the performance of uniprocessors through the
exploitation of instruction level parallelism, ie that parallelism which exists amongst the …