A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

A survey of recent advances in edge-computing-powered artificial intelligence of things

Z Chang, S Liu, X Xiong, Z Cai… - IEEE Internet of Things …, 2021 - ieeexplore.ieee.org
The Internet of Things (IoT) has created a ubiquitously connected world powered by a
multitude of wired and wireless sensors generating a variety of heterogeneous data over …

Efficientformer: Vision transformers at mobilenet speed

Y Li, G Yuan, Y Wen, J Hu… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Vision Transformers (ViT) have shown rapid progress in computer vision tasks,
achieving promising results on various benchmarks. However, due to the massive number of …

Convolutional neural network pruning with structural redundancy reduction

Z Wang, C Li, X Wang - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Convolutional neural network (CNN) pruning has become one of the most successful
network compression approaches in recent years. Existing works on network pruning …

Demystifying parallel and distributed deep learning: An in-depth concurrency analysis

T Ben-Nun, T Hoefler - ACM Computing Surveys (CSUR), 2019 - dl.acm.org
Deep Neural Networks (DNNs) are becoming an important tool in modern computing
applications. Accelerating their training is a major challenge and techniques range from …

[PDF][PDF] 卷积神经网络结构优化综述

林景栋, 吴欣怡, 柴毅, 尹宏鹏 - 自动化学报, 2020 - aas.net.cn
摘要近年来, 卷积神经网络(Convolutional neural network, CNNs) 在计算机视觉,
自然语言处理, 语音识别等领域取得了突飞猛进的发展, 其强大的特征学习能力引起了国内外 …

Real-time neural light field on mobile devices

J Cao, H Wang, P Chemerys… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Recent efforts in Neural Rendering Fields (NeRF) have shown impressive results
on novel view synthesis by utilizing implicit neural representation to represent 3D scenes …

A Survey of Design and Optimization for Systolic Array-based DNN Accelerators

R Xu, S Ma, Y Guo, D Li - ACM Computing Surveys, 2023 - dl.acm.org
In recent years, it has been witnessed that the systolic array is a successful architecture for
DNN hardware accelerators. However, the design of systolic arrays also encountered many …

Deep learning on mobile and embedded devices: State-of-the-art, challenges, and future directions

Y Chen, B Zheng, Z Zhang, Q Wang, C Shen… - ACM Computing …, 2020 - dl.acm.org
Recent years have witnessed an exponential increase in the use of mobile and embedded
devices. With the great success of deep learning in many fields, there is an emerging trend …

Dual-side sparse tensor core

Y Wang, C Zhang, Z Xie, C Guo, Y Liu… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Leveraging sparsity in deep neural network (DNN) models is promising for accelerating
model inference. Yet existing GPUs can only leverage the sparsity from weights but not …