Domain-specialized cache management for graph analytics

P Faldu, J Diamond, B Grot - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Graph analytics power a range of applications in areas as diverse as finance, networking
and business logistics. A common property of graphs used in the domain of graph analytics …

[图书][B] Memory hierarchy design for stream computing

NS Jayasena - 2005 - search.proquest.com
Several classes of applications with abundant fine-grain parallelism, such as media and
signal processing, graphics, and scientific computing, have become increasingly dominant …

Efficient validation of coherency between processor cores and accelerators in computer systems

M Dusanapudi, S Kamaraju, S Kapoor - US Patent 9,501,408, 2016 - Google Patents
(57) ABSTRACT A method of testing cache coherency in a computer system design
allocates different portions of a single cache line for use by accelerators and processors …

Software-controlled cache architecture for energy efficiency

CL Yang, HW Tseng, CC Ho… - IEEE Transactions on …, 2005 - ieeexplore.ieee.org
Power consumption is an important design issue of current multimedia embedded systems.
Data caches consume a significant portion of total processor power for multimedia …

Efficient validation of coherency between processor cores and accelerators in computer systems

M Dusanapudi, S Kamaraju, S Kapoor - US Patent App. 14/038,125, 2014 - Google Patents
(57) ABSTRACT A method of testing cache coherency in a computer system design
allocates different portions of a single cache line for use by accelerators and processors …

[图书][B] Certified run-time code generation

FM Smith - 2002 - search.proquest.com
Run-time code generation (RTCG) has been shown to be an effective optimization. Systems
such as DyC,'C, Tempo, and Fabius have demonstrated order of magnitude improvements …

Cache streamization for high performance stream processor

N Wu, M Wen, J Ren, Y He, CQ Xun… - … Conference on High …, 2009 - ieeexplore.ieee.org
Due to high bandwidth demand on memory system of stream applications, most of stream
processors use software-managed streaming memory. However, this memory …

Addressing variability in reuse prediction for last-level caches

P Faldu - arXiv preprint arXiv:2006.08487, 2020 - arxiv.org
Last-Level Cache (LLC) represents the bulk of a modern CPU processor's transistor budget
and is essential for application performance as LLC enables fast access to data in contrast …

[引用][C] Low Overhead Helper Threads for Software-Based Techniques

GK Dorai, D Yeung