Miriam: Exploiting elastic kernels for real-time multi-dnn inference on edge gpu

Z Zhao, N Ling, N Guan, G Xing - … of the 21st ACM Conference on …, 2023 - dl.acm.org
Many applications such as autonomous driving and augmented reality, require the
concurrent running of multiple deep neural networks (DNN) that poses different levels of real …

Fasor: A Fast Tensor Program Optimization Framework for Efficient DNN Deployment

H Huang, X Chen, J Zhao - Proceedings of the 38th ACM International …, 2024 - dl.acm.org
With the growing importance of deploying deep neural networks (DNNs), there are
increasing demands to improve both the efficiency and quality of tensor program …

[图书][B] Analysing and Reducing Costs of Deep Learning Compiler Auto-tuning

D Borowiec - 2023 - search.proquest.com
Deep Learning (DL) is significantly impacting many industries, including automotive, retail
and medicine, enabling autonomous driving, recommender systems and genomics …