Performance aware convolutional neural network channel pruning for embedded GPUs

V Radu, K Kaszyk, Y Wen, J Turner… - 2019 IEEE …, 2019 - ieeexplore.ieee.org
Convolutional Neural Networks (CNN) are becoming a common presence in many
applications and services, due to their superior recognition accuracy. They are increasingly …

Efficient auto-tuning of parallel programs with interdependent tuning parameters via auto-tuning framework (ATF)

A Rasch, R Schulze, M Steuwer… - ACM Transactions on …, 2021 - dl.acm.org
Auto-tuning is a popular approach to program optimization: it automatically finds good
configurations of a program's so-called tuning parameters whose values are crucial for …

End-to-end characterization of game streaming applications on mobile platforms

S Bhuyan, S Zhao, Z Ying, MT Kandemir… - Proceedings of the ACM …, 2022 - dl.acm.org
With the advent of 5G, supporting high-quality game streaming applications on edge devices
has become a reality. This is evidenced by a recent surge in cloud gaming applications on …

Horus: A modular GPU emulator framework

AS Elhelw, S Pai - … on Performance Analysis of Systems and …, 2020 - ieeexplore.ieee.org
Graphics Processing Units (GPUs) are widely used to run general-purpose computing
workloads. Three approaches currently exist to observe the dynamic behaviour of these …

[HTML][HTML] 应用于射电天文的高效实时管道数据流传输与处理技术

张萌, 张海龙, 王杰, 李健, 冶鑫晨, 王万琼, 李嘉… - 2021 - html.rhhz.net
针对超宽带及多波束接收系统海量天文信号实时高效传输与处理问题, 对基于现场可编程门阵列
(Field Programmable Gate Array, FPGA)+ 图形处理器(Graphics Processing Unit, GPU) …

Simulation methodologies for mobile GPUs

K Kaszyk - 2022 - era.ed.ac.uk
GPUs critically rely on a complex system software stack comprising kernel-and user-space
drivers and JIT compilers. Yet, existing GPU simulators typically abstract away details of the …

Efficient Real-time Data Transmission and Processing Technologies Applied to Radio Astronomy

Z Meng, Z Hailong, W Jie, L Jian, Y Xinchen… - Astronomical …, 2021 - ati.ac.cn
Aiming at problems of massive signals real-time transmission and processing in ultra-
wideband and multi-beam receiving systems, we tested and analyzed the related software of …

UK Systems Research Challenges Workshop: Fast, Unmodified, Full-system Mobile CPU/GPU Simulation

T Spink - uksystems.org
Graphics Processing Units (GPUs) have seen a lot of attention over the past decade, and in
particular have grown to support workloads that are not strictly graphical in nature. Their …

Full-System GPU Design Space Exploration

K Kaszyk, B Franke - Workshop on Modeling & Simulation of …, 2020 - research.ed.ac.uk
The prevalence of machine learning in recent times has had a dramatic impact on the way
we design and use mobile and IoT systems. With growing concerns for privacy and qualityof …