P6 binary floating-point unit

SB Park, S Mitra - Proceedings of the 45th annual Design Automation …, 2008 - dl.acm.org

The objective of IFRA, Instruction Footprint Recording and Analysis, is to overcome the
challenges associated with a very expensive step in post-silicon validation of processors …

被引用次数：373 相关文章所有 10 个版本

[PDF] uth.gr

Ibm power6 microarchitecture

HQ Le, WJ Starke, JS Fields… - IBM Journal of …, 2007 - ieeexplore.ieee.org

This paper describes the implementation of the IBM POWER6™ microprocessor, a two-way
simultaneous multithreaded (SMT) dual-core chip whose key features include binary …

被引用次数：371 相关文章所有 18 个版本

[PDF] stanford.edu

Energy-efficient floating-point unit design

S Galal, M Horowitz - IEEE Transactions on computers, 2010 - ieeexplore.ieee.org

Energy-efficient computation is critical if we are going to continue to scale performance in
power-limited systems. For floating-point applications that have large amounts of data …

被引用次数：206 相关文章所有 7 个版本

[PDF] psu.edu

Ibm power6 accelerators: Vmx and dfu

L Eisen, JW Ward, HW Tast, N Mading… - IBM Journal of …, 2007 - ieeexplore.ieee.org

The IBM POWER6™ microprocessor core includes two accelerators for increasing
performance of specific workloads. The vector multimedia extension (VMX) provides a vector …

被引用次数：148 相关文章所有 6 个版本

[PDF] jilp.org

Access map pattern matching for data cache prefetch

Y Ishii, M Inaba, K Hiraki - … of the 23rd international conference on …, 2009 - dl.acm.org

A novel data prefetching method--access map pattern matching (AMPM)--that uses" memory
access map" is proposed. The AMPM prefetching concentrate hardware resources on …

被引用次数：115 相关文章所有 5 个版本

[PDF] psu.edu

Floating-point division and square root using a Taylor-series expansion algorithm

TJ Kwon, J Draper - Microelectronics Journal, 2009 - Elsevier

Hardware support for floating-point (FP) arithmetic is a mandatory feature of modern
microprocessor design. Although division and square root are relatively infrequent …

被引用次数：74 相关文章所有 9 个版本

[PDF] googleapis.com

Chained split execution of fused compound arithmetic operations

T Elmer, NA Patil - US Patent 11,061,672, 2021 - Google Patents

A microprocessor is configured for unchained and chained modes of split execution of a
fused compound arithmetic operation. In both modes of split execution, a first execution unit …

被引用次数：48 相关文章所有 4 个版本

[PDF] psu.edu

Low-power leading-zero counting and anticipation logic for high-speed floating point units

G Dimitrakopoulos, K Galanopoulos… - IEEE transactions on …, 2008 - ieeexplore.ieee.org

In this paper, a new leading-zero counter (or detector) is presented. New boolean relations
for the bits of the leading-zero count are derived that allow their computation to be performed …

被引用次数：72 相关文章所有 9 个版本

[PDF] acsel-lab.com

Quad precision floating point on the IBM z13

C Lichtenau, S Carlough… - 2016 IEEE 23nd …, 2016 - ieeexplore.ieee.org

When operating on a rapidly increasing amount of data, business analytics applications
become sensitive to rounding errors, and profit from the higher stability and faster …

被引用次数：28 相关文章所有 3 个版本

Area efficient and fast combined binary/decimal floating point fused multiply add unit

AA Wahba, HAH Fahmy - IEEE Transactions on Computers, 2016 - ieeexplore.ieee.org

In this work we present a new 64-bit floating point Fused Multiply Add (FMA) unit that can
perform both binary and decimal addition, multiplication, and fused-multiply-add operations …

被引用次数：31 相关文章所有 3 个版本

高级搜索

QQ 群

IFRA: Instruction footprint recording and analysis for post-silicon bug localization in processors

Ibm power6 microarchitecture

Energy-efficient floating-point unit design

Ibm power6 accelerators: Vmx and dfu

Access map pattern matching for data cache prefetch

Floating-point division and square root using a Taylor-series expansion algorithm

Chained split execution of fused compound arithmetic operations

Low-power leading-zero counting and anticipation logic for high-speed floating point units

Quad precision floating point on the IBM z13

Area efficient and fast combined binary/decimal floating point fused multiply add unit

引用