Concurrency extraction via hardware methods executing the static instruction stream

P Marcuello, A González, J Tubella - Proceedings of the 12th …, 1998 - dl.acm.org

In this paper we present a novel processor microarchitecture that relieves, four of the most
important bottlenecks of superscalar processors: the serialization imposed by true …

被引用次数：228 相关文章所有 12 个版本

[PDF] researchgate.net

Disjoint eager execution: An optimal form of speculative execution

AK Uht, V Sindagi, K Hall - Proceedings of the 28th annual …, 1995 - ieeexplore.ieee.org

Instruction Level Parallelism (ILP) speedups of an order-of-magnitude or greater may be
possible using the techniques described herein. Traditional speculative code execution is …

被引用次数：172 相关文章所有 14 个版本

[PDF] acm.org

Improving superscalar instruction dispatch and issue by exploiting dynamic code sequences

S Vajapeyam, T Mitra - ACM SIGARCH Computer Architecture News, 1997 - dl.acm.org

Superscalar processors currently have the potential to fetch multiple basic blocks per cycle
by employing one of several recently proposed instruction fetch mechanisms. However, this …

被引用次数：155 相关文章所有 19 个版本

[PDF] upc.edu

Speculative dynamic vectorization

A Pajuelo, A González, M Valero - ACM SIGARCH Computer …, 2002 - dl.acm.org

Traditional vector architectures have shown to be very effective for regular codes where the
compiler can detect data-level parallelism. However, this SIMD parallelism is also present in …

被引用次数：62 相关文章所有 22 个版本

[PDF] acm.org

Dynamic vectorization: a mechanism for exploiting far-flung ILP in ordinary programs

S Vajapeyam, PJ Joseph, T Mitra - ACM SIGARCH Computer …, 1999 - dl.acm.org

Several ILP limit studies indicate the presence of considerable ILP across dynamically far-
apart instructions in program execution. This paper proposes a hardware mechanism …

被引用次数：36 相关文章所有 18 个版本

[PDF] psu.edu

Exploiting speculative thread-level parallelism on a SMT processor

P Marcuello, A González - International Conference on High-Performance …, 1999 - Springer

In this paper we present a run-time mechanism to simultaneously execute multiple threads
from a sequential program on a simultaneous multithreaded (SMT) processor. The threads …

被引用次数：29 相关文章所有 9 个版本

A theory of reduced and minimal procedural dependencies

AK Uht - IEEE Transactions on Computers, 1991 - computer.org

A reduced set of procedural dependencies is presented which is necessary and sufficient to
describe all procedural dependencies in standard imperative codes. Hence, the set is …

被引用次数：34 相关文章所有 5 个版本

[PDF] psu.edu

[图书][B] Hardware and software for functional and fine-grain parallelism

CJ Beckmann - 1993 - search.proquest.com

This thesis examines nonloop parallelism at both fine and coarse levels of granularity in
numerical FORTRAN programs. Measurements of the extent of this functional parallelism in …

被引用次数：34 相关文章所有 8 个版本

[PDF] jilp.org

[PDF][PDF] Levo-a scalable processor with high IPC

AK Uht, D Morano, A Khalafi, DR Kaeli - Journal of Instruction-Level …, 2003 - jilp.org

The Levo high IPC microarchitecture is described and evaluated. Levo employs instruction
time-tags and Active Stations to ensure correct operation in a rampantly speculative and out …

被引用次数：21 相关文章所有 9 个版本

[PDF] acm.org

Extraction of massive instruction level parallelism

AK Uht - ACM SIGARCH Computer Architecture News, 1993 - dl.acm.org

Our goal is to dramatically increase the performance of uniprocessors through the
exploitation of instruction level parallelism, ie that parallelism which exists amongst the …

被引用次数：19 相关文章所有 7 个版本

高级搜索

QQ 群