Accelerating sparse dnns based on tiled gemm

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Accelerating sparse dnns based on tiled gemm

在引用文章中搜索

[PDF] acm.org

Fractal: Joint Multi-Level Sparse Pattern Tuning of Accuracy and Performance for DNN Pruning

Y Guan, C Yu, Y Zhou, J Leng, C Li, M Guo - Proceedings of the 29th …, 2024 - dl.acm.org

Model pruning, which eliminates redundant parameters and reduces computational
complexity, emerges as a viable strategy for efficient deep neural network (DNN) …

被引用次数：1 相关文章

[PDF] arxiv.org

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

J Xu, R Zhang, C Guo, W Hu, Z Liu, F Wu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) are widely used across various domains, processing
millions of daily requests. This surge in demand poses significant challenges in optimizing …

Scaling Analog Photonic Accelerators for Byte-Size, Integer General Matrix Multiply (GEMM) Kernels

OA Alo, SS Vatsavai, I Thakkar - arXiv preprint arXiv:2407.06134, 2024 - arxiv.org

Deep Neural Networks (DNNs) predominantly rely on General Matrix Multiply (GEMM)
kernels, which are often accelerated using specialized hardware architectures. Recently …

高级搜索

QQ 群

Accelerating sparse dnns based on tiled gemm

Fractal: Joint Multi-Level Sparse Pattern Tuning of Accuracy and Performance for DNN Pruning

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

Scaling Analog Photonic Accelerators for Byte-Size, Integer General Matrix Multiply (GEMM) Kernels

引用