TIE: Energy-efficient tensor train-based inference engine for deep neural network

C Deng, F Sun, X Qian, J Lin, Z Wang… - Proceedings of the 46th …, 2019 - dl.acm.org
In the era of artificial intelligence (AI), deep neural networks (DNNs) have emerged as the
most important and powerful AI technique. However, large DNN models are both storage …

ETTE: Efficient tensor-train-based computing engine for deep neural networks

Y Gong, M Yin, L Huang, J Xiao, Y Sui, C Deng… - Proceedings of the 50th …, 2023 - dl.acm.org
Tensor-train (TT) decomposition enables ultra-high compression ratio, making the deep
neural network (DNN) accelerators based on this method very attractive. TIE, the state-of-the …

Batude: Budget-aware neural network compression based on tucker decomposition

M Yin, H Phan, X Zang, S Liao, B Yuan - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Abstract Model compression is very important for the efficient deployment of deep neural
network (DNN) models on resource-constrained devices. Among various model …

Towards efficient tensor decomposition-based dnn model compression with optimization framework

M Yin, Y Sui, S Liao, B Yuan - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Advanced tensor decomposition, such as Tensor train (TT) and Tensor ring (TR), has been
widely studied for deep neural network (DNN) model compression, especially for recurrent …

Tensor Decomposition for Model Reduction in Neural Networks: A Review [Feature]

X Liu, KK Parhi - IEEE Circuits and Systems Magazine, 2023 - ieeexplore.ieee.org
Modern neural networks have revolutionized the fields of computer vision (CV) and Natural
Language Processing (NLP). They are widely used for solving complex CV tasks and NLP …

Bayesian tensorized neural networks with automatic rank selection

C Hawkins, Z Zhang - Neurocomputing, 2021 - Elsevier
Tensor decomposition is an effective approach to compress over-parameterized neural
networks and to enable their deployment on resource-constrained hardware platforms …

Hybrid tensor decomposition in neural network compression

B Wu, D Wang, G Zhao, L Deng, G Li - Neural Networks, 2020 - Elsevier
Deep neural networks (DNNs) have enabled impressive breakthroughs in various artificial
intelligence (AI) applications recently due to its capability of learning high-level features from …

Nonlinear tensor train format for deep neural network compression

D Wang, G Zhao, H Chen, Z Liu, L Deng, G Li - Neural Networks, 2021 - Elsevier
Deep neural network (DNN) compression has become a hot topic in the research of deep
learning since the scale of modern DNNs turns into too huge to implement on practical …

Deep convolutional neural network compression via coupled tensor decomposition

W Sun, S Chen, L Huang, HC So… - IEEE Journal of Selected …, 2020 - ieeexplore.ieee.org
Large neural networks have aroused impressive progress in various real world applications.
However, the expensive storage and computational resources requirement for running deep …

Compression of deep neural networks based on quantized tensor decomposition to implement on reconfigurable hardware platforms

A Nekooei, S Safari - Neural Networks, 2022 - Elsevier
Abstract Deep Neural Networks (DNNs) have been vastly and successfully employed in
various artificial intelligence and machine learning applications (eg, image processing and …