Analysis of the hands in egocentric vision: A survey

A Bandini, J Zariffa - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org
Egocentric vision (aka first-person vision–FPV) applications have thrived over the past few
years, thanks to the availability of affordable wearable cameras and large annotated …

Animal pose estimation: A closer look at the state-of-the-art, existing gaps and opportunities

L Jiang, C Lee, D Teotia, S Ostadabbas - Computer Vision and Image …, 2022 - Elsevier
Over the past few years, research on animal pose estimation in computer vision field has
grown in many aspects such as 2D and 3D pose estimation, 3D mesh reconstruction, and …

Clean-label backdoor attacks on video recognition models

S Zhao, X Ma, X Zheng, J Bailey… - Proceedings of the …, 2020 - openaccess.thecvf.com
Deep neural networks (DNNs) are vulnerable to backdoor attacks which can hide backdoor
triggers in DNNs by poisoning training data. A backdoored model behaves normally on …

Ava: A video dataset of spatio-temporally localized atomic visual actions

C Gu, C Sun, DA Ross, C Vondrick… - Proceedings of the …, 2018 - openaccess.thecvf.com
This paper introduces a video dataset of spatio-temporally localized Atomic Visual Actions
(AVA). The AVA dataset densely annotates 80 atomic visual actions in 437 15-minute video …

Cola: Weakly-supervised temporal action localization with snippet contrastive learning

C Zhang, M Cao, D Yang, J Chen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Weakly-supervised temporal action localization (WS-TAL) aims to localize actions in
untrimmed videos with only video-level labels. Most existing models follow the" localization …

Spatio-temporal autoencoder for video anomaly detection

Y Zhao, B Deng, C Shen, Y Liu, H Lu… - Proceedings of the 25th …, 2017 - dl.acm.org
Anomalous events detection in real-world video scenes is a challenging problem due to the
complexity of" anomaly" as well as the cluttered backgrounds, objects and motions in the …

Deep video deblurring for hand-held cameras

S Su, M Delbracio, J Wang, G Sapiro… - Proceedings of the …, 2017 - openaccess.thecvf.com
Motion blur from camera shake is a major problem in videos captured by hand-held devices.
Unlike single-image deblurring, video-based approaches can take advantage of the …

Real-time action recognition with enhanced motion vector CNNs

B Zhang, L Wang, Z Wang, Y Qiao… - Proceedings of the …, 2016 - openaccess.thecvf.com
The deep two-stream architecture exhibited excellent performance on video based action
recognition. The most computationally expensive step in this approach comes from the …

Efficient spatio-temporal recurrent neural network for video deblurring

Z Zhong, Y Gao, Y Zheng, B Zheng - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Real-time video deblurring still remains a challenging task due to the complexity of spatially
and temporally varying blur itself and the requirement of low computational cost. To improve …

Controllable video captioning with pos sequence guidance based on gated fusion network

B Wang, L Ma, W Zhang, W Jiang… - Proceedings of the …, 2019 - openaccess.thecvf.com
In this paper, we propose to guide the video caption generation with Part-of-Speech (POS)
information, based on a gated fusion of multiple representations of input videos. We …