IFRA: Instruction footprint recording and analysis for post-silicon bug localization in processors

SB Park, S Mitra - Proceedings of the 45th annual Design Automation …, 2008 - dl.acm.org
The objective of IFRA, Instruction Footprint Recording and Analysis, is to overcome the
challenges associated with a very expensive step in post-silicon validation of processors …

Ibm power6 microarchitecture

HQ Le, WJ Starke, JS Fields… - IBM Journal of …, 2007 - ieeexplore.ieee.org
This paper describes the implementation of the IBM POWER6™ microprocessor, a two-way
simultaneous multithreaded (SMT) dual-core chip whose key features include binary …

Energy-efficient floating-point unit design

S Galal, M Horowitz - IEEE Transactions on computers, 2010 - ieeexplore.ieee.org
Energy-efficient computation is critical if we are going to continue to scale performance in
power-limited systems. For floating-point applications that have large amounts of data …

Ibm power6 accelerators: Vmx and dfu

L Eisen, JW Ward, HW Tast, N Mading… - IBM Journal of …, 2007 - ieeexplore.ieee.org
The IBM POWER6™ microprocessor core includes two accelerators for increasing
performance of specific workloads. The vector multimedia extension (VMX) provides a vector …

Access map pattern matching for data cache prefetch

Y Ishii, M Inaba, K Hiraki - … of the 23rd international conference on …, 2009 - dl.acm.org
A novel data prefetching method--access map pattern matching (AMPM)--that uses" memory
access map" is proposed. The AMPM prefetching concentrate hardware resources on …

Floating-point division and square root using a Taylor-series expansion algorithm

TJ Kwon, J Draper - Microelectronics Journal, 2009 - Elsevier
Hardware support for floating-point (FP) arithmetic is a mandatory feature of modern
microprocessor design. Although division and square root are relatively infrequent …

Chained split execution of fused compound arithmetic operations

T Elmer, NA Patil - US Patent 11,061,672, 2021 - Google Patents
A microprocessor is configured for unchained and chained modes of split execution of a
fused compound arithmetic operation. In both modes of split execution, a first execution unit …

Low-power leading-zero counting and anticipation logic for high-speed floating point units

G Dimitrakopoulos, K Galanopoulos… - IEEE transactions on …, 2008 - ieeexplore.ieee.org
In this paper, a new leading-zero counter (or detector) is presented. New boolean relations
for the bits of the leading-zero count are derived that allow their computation to be performed …

Quad precision floating point on the IBM z13

C Lichtenau, S Carlough… - 2016 IEEE 23nd …, 2016 - ieeexplore.ieee.org
When operating on a rapidly increasing amount of data, business analytics applications
become sensitive to rounding errors, and profit from the higher stability and faster …

Area efficient and fast combined binary/decimal floating point fused multiply add unit

AA Wahba, HAH Fahmy - IEEE Transactions on Computers, 2016 - ieeexplore.ieee.org
In this work we present a new 64-bit floating point Fused Multiply Add (FMA) unit that can
perform both binary and decimal addition, multiplication, and fused-multiply-add operations …