Dynamic perceiver for efficient visual recognition

Y Han, D Han, Z Liu, Y Wang, X Pan… - Proceedings of the …, 2023 - openaccess.thecvf.com
Early exiting has become a promising approach to im-proving the inference efficiency of
deep networks. By structuring models with multiple classifiers (exits), predictions for" easy" …

Efficienttrain: Exploring generalized curriculum learning for training visual backbones

Y Wang, Y Yue, R Lu, T Liu, Z Zhong… - Proceedings of the …, 2023 - openaccess.thecvf.com
The superior performance of modern deep networks usually comes with a costly training
procedure. This paper presents a new curriculum learning approach for the efficient training …

Borrowing knowledge from pre-trained language model: A new data-efficient visual learning paradigm

W Ma, S Li, JM Zhang, CH Liu, J Kang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The development of vision models for real-world applications is hindered by the challenge of
annotated data scarcity, which has necessitated the adoption of data-efficient visual learning …

Action recognition in compressed domains: A survey

Y Ming, J Zhou, N Hu, F Feng, P Zhao, B Lyu, H Yu - Neurocomputing, 2024 - Elsevier
Human action recognition (HAR) refers to the process in which computers analyze and
process video data to obtain the categories of action presented in the video. It has a wide …

Computation-efficient deep learning for computer vision: A survey

Y Wang, Y Han, C Wang, S Song… - Cybernetics and …, 2024 - ieeexplore.ieee.org
Over the past decade, deep learning models have exhibited considerable advancements,
reaching or even exceeding human-level performance in a range of visual perception tasks …

Scalable frame resolution for efficient continuous sign language recognition

L Hu, L Gao, Z Liu, W Feng - Pattern Recognition, 2024 - Elsevier
In this paper, we explore the spatial redundancy in continuous sign language recognition
(CSLR), aiming to improve its efficiency. Despite recent advances in accuracy in CSLR, state …

Causal Inference Meets Deep Learning: A Comprehensive Survey

L Jiao, Y Wang, X Liu, L Li, F Liu, W Ma, Y Guo, P Chen… - Research, 2024 - spj.science.org
Deep learning relies on learning from extensive data to generate prediction results. This
approach may inadvertently capture spurious correlations within the data, leading to models …

Compressed video action recognition with dual-stream and dual-modal transformer

Y Mou, X Jiang, K Xu, T Sun… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Compressed video action recognition offers the advantage of reducing decoding and
inference time compared to the RGB domain. However, the compressed domain poses …

CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement

Q Zhu, J Hao, Y Ding, Y Liu, Q Mo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently numerous approaches have achieved notable success in compressed video
quality enhancement (VQE). However these methods usually ignore the utilization of …

A veracity dissemination consistency-based few-shot fake news detection framework by synergizing adversarial and contrastive self-supervised learning

W Jin, N Wang, T Tao, B Shi, H Bi, B Zhao, H Wu… - Scientific Reports, 2024 - nature.com
With the rapid growth of social media, fake news (rumors) are rampant online, seriously
endangering the health of mainstream social consciousness. Fake news detection (FEND) …