Timekeeping in the memory system: predicting and optimizing memory behavior

Z Shi, X Huang, A Jain, C Lin - Proceedings of the 52nd Annual IEEE …, 2019 - dl.acm.org

Despite its success in many areas, deep learning is a poor fit for use in hardware predictors
because these models are impractically large and slow, but this paper shows how we can …

被引用次数：170 相关文章所有 4 个版本

[图书][B] Memory systems: cache, DRAM, disk

B Jacob, D Wang, S Ng - 2010 - books.google.com

Is your memory hierarchy stopping your microprocessor from performing at the high level it
should be? Memory Systems: Cache, DRAM, Disk shows you how to resolve this problem …

被引用次数：1198 相关文章所有 6 个版本

[PDF] ufpr.br

SHiP: Signature-based hit predictor for high performance caching

CJ Wu, A Jaleel, W Hasenplaugh, M Martonosi… - Proceedings of the 44th …, 2011 - dl.acm.org

The shared last-level caches in CMPs play an important role in improving application
performance and reducing off-chip memory bandwidth requirements. In order to use LLCs …

被引用次数：347 相关文章所有 15 个版本

[PDF] ucsd.edu

Bounded geometries, fractals, and low-distortion embeddings

A Gupta, R Krauthgamer, JR Lee - 44th Annual IEEE …, 2003 - ieeexplore.ieee.org

The doubling constant of a metric space (X, d) is the smallest value/spl lambda/such that
every ball in X can be covered by/spl lambda/balls of half the radius. The doubling …

被引用次数：532 相关文章所有 20 个版本

[PDF] researchgate.net

Technology comparison for large last-level caches (L³Cs): Low-leakage SRAM, low write-energy STT-RAM, and refresh-optimized eDRAM

MT Chang, P Rosenfeld, SL Lu… - 2013 IEEE 19th …, 2013 - ieeexplore.ieee.org

Large last-level caches (L 3 Cs) are frequently used to bridge the performance and power
gap between processor and memory. Although traditional processors implement caches as …

被引用次数：302 相关文章所有 11 个版本

[PDF] psu.edu

PIPP: Promotion/insertion pseudo-partitioning of multi-core shared caches

Y Xie, GH Loh - ACM SIGARCH Computer Architecture News, 2009 - dl.acm.org

Many multi-core processors employ a large last-level cache (LLC) shared among the
multiple cores. Past research has demonstrated that sharing-oblivious cache management …

被引用次数：432 相关文章所有 11 个版本

[PDF] psu.edu

Sampling dead block prediction for last-level caches

SM Khan, Y Tian, DA Jimenez - 2010 43rd Annual IEEE/ACM …, 2010 - ieeexplore.ieee.org

Last-level caches (LLCs) are large structures with significant power requirements. They can
be quite inefficient. On average, a cache block in a 2MB LRU-managed LLC is dead 86% of …

被引用次数：285 相关文章所有 13 个版本

[PDF] nsf.gov

Perceptron learning for reuse prediction

E Teran, Z Wang, DA Jiménez - 2016 49th Annual IEEE/ACM …, 2016 - ieeexplore.ieee.org

The disparity between last-level cache and memory latencies motivates the search for
efficient cache management policies. Recent work in predicting reuse of cache blocks …

被引用次数：157 相关文章所有 5 个版本

Counter-based cache replacement and bypassing algorithms

M Kharbutli, Y Solihin - IEEE Transactions on Computers, 2008 - ieeexplore.ieee.org

Recent studies have shown that, in highly associative caches, the performance gap between
the least recently used (LRU) and the theoretical optimal replacement algorithms is large …

被引用次数：294 相关文章所有 6 个版本

[PDF] kaist.ac.kr

Cache bursts: A new approach for eliminating dead blocks and increasing cache efficiency

H Liu, M Ferdman, J Huh… - 2008 41st IEEE/ACM …, 2008 - ieeexplore.ieee.org

Data caches in general-purpose microprocessors often contain mostly dead blocks and are
thus used inefficiently. To improve cache efficiency, dead blocks should be identified and …

被引用次数：260 相关文章所有 12 个版本

高级搜索

QQ 群