Sparsetrain: Leveraging dynamic sparsity in software for training dnns on general-purpose simd processors

Z Gong, H Ji, CW Fletcher, CJ Hughes… - Proceedings of the ACM …, 2020 - dl.acm.org
Our community has improved the efficiency of deep learning applications by exploiting
sparsity in inputs. Most of that work, though, is for inference, where weight sparsity is known …

Branch runahead: An alternative to branch prediction for impossible to predict branches

S Pruett, Y Patt - MICRO-54: 54th Annual IEEE/ACM International …, 2021 - dl.acm.org
High performance microprocessors require high levels of instruction supply. Branch
prediction has been the most important driver of this for nearly 30 years. Unfortunately …

Clairvoyance: Look-ahead compile-time scheduling

KA Tran, TE Carlson, K Koukos… - 2017 IEEE/ACM …, 2017 - ieeexplore.ieee.org
To enhance the performance of memory-bound applications, hardware designs have been
developed to hide memory latency, such as the out-of-order (OoO) execution engine, at the …

Clearing the shadows: Recovering lost performance for invisible speculative execution through hw/sw co-design

KA Tran, C Sakalis, M Själander, A Ros… - Proceedings of the …, 2020 - dl.acm.org
Out-of-order processors heavily rely on speculation to achieve high performance, allowing
instructions to bypass other slower instructions in order to fully utilize the processor's …

SWOOP: Software-hardware co-design for non-speculative, execute-ahead, in-order cores

KA Tran, A Jimborean, TE Carlson, K Koukos… - Proceedings of the 39th …, 2018 - dl.acm.org
Increasing demands for energy efficiency constrain emerging hardware. These new
hardware trends challenge the established assumptions in code generation and force us to …

[PDF][PDF] Using Convolutional Neural Networks to Improve Branch Prediction

SZ Kamali - 2022 - hps.ece.utexas.edu
My advisor, Prof. Yale Patt has been instrumental in the completion of my Ph. D. program.
Thank you for inspiring my interest in computer architecture, for convincing me to pursue my …

By-software branch prediction in loops

M Goudarzi, R Azimi, J Humecki… - IEEE Computer …, 2023 - ieeexplore.ieee.org
Load-Dependent Branches (LDB) often do not exhibit regular patterns in their local or global
history and thus are inherently hard to predict correctly by conventional branch predictors …

Static Branch Prediction through Representation Learning

P Alovisi - 2020 - diva-portal.org
In the context of compilers, branch probability prediction deals with estimating the probability
of a branch to be taken in a program. In the absence of profiling information, compilers rely …

[PDF][PDF] 一种先进的扁平化谓词及编译优化方法

王向前, 郑启龙, 张仁高, 韩东科 - 中国科学技术大学学报, 2019 - just.ustc.edu.cn
谓词执行是有效挖掘控制流程序指令级并行性的一种机制. 经典的谓词实现一般局部地逐个进行
谓词计算而不能进行多谓词控制, 有谓词计算路径过长等问题. 针对经典谓词存在的问题 …

Using convolutional neural networks to improve branch prediction

S Zangeneh Kamali - 2022 - repositories.lib.utexas.edu
The state-of-the-art branch predictor, TAGE, remains inefficient at identifying correlated
branches deep in a noisy global branch history. This dissertation argues this inefficiency is a …