SwitchingNet: Edge-Assisted Model Switching for Accurate Video Recognition Over Best-Effort Networks

F Beye, Y Babazaki, R Ando, T Oshiba… - 2024 IEEE 21st …, 2024 - ieeexplore.ieee.org
Despite the remarkable success of deep-learning in image and video recognition,
constructing real-time recognition systems for computationally intensive tasks such as spatio …

Tsm-mobilenetv3: A novel lightweight network model for video action recognition

S Zhang, Q Tong, Z Kong, H Lin - 2023 4th International …, 2023 - ieeexplore.ieee.org
The deployment of video action recognition models on mobile and embedded devices is
challenging due to the limited computational resources and storage capacity. To address …

MTRFN: Multiscale temporal receptive field network for compressed video action recognition at edge servers

L He, M Zhang, S Zhang, L Wang… - IEEE Internet of Things …, 2022 - ieeexplore.ieee.org
With the wide deployment of Internet of Things monitoring terminals, a tremendous number
of videos are accumulated continuously. Big data processing and analysis-based action …

Dynamic inference: A new approach toward efficient video action recognition

W Wu, D He, X Tan, S Chen… - Proceedings of the …, 2020 - openaccess.thecvf.com
Though action recognition in videos has achieved great success recently, it remains a
challenging task due to the massive computational cost. Designing lightweight networks is a …

Structural Reparameterization Lightweight Network for Video Action Recognition

A Zhu, Y Wang, W Li, P Qian - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
3D convolution networks play an important role in extracting spatiotemporal features in
video action recognition. However, it usually brings a large number of paramters, which …

Mimic the raw domain: Accelerating action recognition in the compressed domain

B Battash, H Barad, H Tang… - Proceedings of the …, 2020 - openaccess.thecvf.com
Video understanding usually requires expensive computation that prohibits its deployment,
yet videos contain significant spatiotemporal redundancy that can be exploited. In particular …

Physical knowledge driven multi-scale temporal receptive field network for compressed video action recognition

L He, M Zhang, S Zhang, F Li - Adjunct Proceedings of the 2021 ACM …, 2021 - dl.acm.org
Intelligent terminal based action recognition is important to smart cities. However, due to the
dependency on training data and high complexity of extracting information, the existing …

Dynamic normalization and relay for video action recognition

D Cai, A Yao, Y Chen - Advances in neural information …, 2021 - proceedings.neurips.cc
Abstract Convolutional Neural Networks (CNNs) have been the dominant model for video
action recognition. Due to the huge memory and compute demand, popular action …

Ean: event adaptive network for enhanced action recognition

Y Tian, Y Yan, G Zhai, G Guo, Z Gao - International Journal of Computer …, 2022 - Springer
Efficiently modeling spatial–temporal information in videos is crucial for action recognition.
To achieve this goal, state-of-the-art methods typically employ the convolution operator and …

Fastformer: transformer-based fast reasoning framework

W Zhu, L Guo, T Zhang, F Han, Y Wei… - … on Graphics and …, 2023 - spiedigitallibrary.org
Video action recognition is a vital task in the field of computer vision. A great deal of
redundant information is generated along with original video data in the process of depth …