GreenMM: energy efficient GPU matrix multiplication through undervolting

H Zamani, Y Liu, D Tripathy, L Bhuyan… - Proceedings of the ACM …, 2019 - dl.acm.org
The current trend of ever-increasing performance in scientific applications comes with
tremendous growth in energy consumption. In this paper, we present GreenMM framework …

SAOU: safe adaptive overclocking and undervolting for energy-efficient GPU computing

H Zamani, D Tripathy, L Bhuyan, Z Chen - Proceedings of the ACM/IEEE …, 2020 - dl.acm.org
The current trend of ever-increasing performance in scientific applications comes with
tremendous growth in energy consumption. In this paper, we present a framework for GPU …

Customizing the HPL for China accelerator

X Gan, Y Hu, J Liu, L Chi, H Xu, C Gong, S Li… - Science China …, 2018 - Springer
HPL is a Linpack benchmark package widely used in high-performance computing tests.
Customizing the HPL is crucial for a heterogeneous system equipped with CPU and the …

[图书][B] Improving Energy Efficiency of Basic Linear Algebra Routines on Heterogeneous Systems With Multiple GPUs

HZ Sabzi - 2023 - search.proquest.com
The current trend of ever-increasing performance in high performance computing (HPC)
applications comes with tremendous growth in energy consumption. Because existing …

Parallel matrix multiplication for various implementations

N Taghiyev, M Akcay - 2013 7th International Conference on …, 2013 - ieeexplore.ieee.org
It has become increasingly common to see that supercomputing applications harness the
massive parallelism of graphics cards to speed up computations. In this study, an analysis …

Computation-communication overlap of linpack on a GPU-accelerated PC cluster

J Ohmura, T Miyoshi, H Irie… - IEICE TRANSACTIONS on …, 2011 - search.ieice.org
In this paper, we propose an approach to obtaining enhanced performance of the Linpack
benchmark on a GPU-accelerated PC cluster connected via relatively slow inter-node …

Improving Energy Efficiency of Basic Linear Algebra Routines on Heterogeneous Systems With Multiple GPUs

H Zamani Sabzi - 2022 - escholarship.org
The current trend of ever-increasing performance in high performance computing (HPC)
applications comes with tremendous growth in energy consumption. Because existing …

MPI を埋め込み可能なGPU プログラミングフレームワークの検討

三好健文, 近藤正章, 入江英嗣, 吉永努… - 先進的計算基盤 …, 2011 - ipsj.ixsq.nii.ac.jp
論文抄録 GPU を持つ複数の計算ノードで構成されるクラスタ計算機で, 所望の計算を効率良く実行
するためには, 複数の GPU に効率良く処理を割り当てる必要がある. クラスタ計算機内の複数ノード …

[引用][C] Master Thesis Proposal

AB Andersen, R Pagh