- 学术资源搜索

Analysis of the hands in egocentric vision: A survey

A Bandini, J Zariffa - IEEE transactions on pattern analysis and …, 2020 - ieeexplore.ieee.org

Egocentric vision (aka first-person vision–FPV) applications have thrived over the past few
years, thanks to the availability of affordable wearable cameras and large annotated …

被引用次数：108 相关文章所有 8 个版本

[PDF] openreview.net

Animal pose estimation: A closer look at the state-of-the-art, existing gaps and opportunities

L Jiang, C Lee, D Teotia, S Ostadabbas - Computer Vision and Image …, 2022 - Elsevier

Over the past few years, research on animal pose estimation in computer vision field has
grown in many aspects such as 2D and 3D pose estimation, 3D mesh reconstruction, and …

被引用次数：39 相关文章所有 2 个版本

[PDF] thecvf.com

Clean-label backdoor attacks on video recognition models

S Zhao, X Ma, X Zheng, J Bailey… - Proceedings of the …, 2020 - openaccess.thecvf.com

Deep neural networks (DNNs) are vulnerable to backdoor attacks which can hide backdoor
triggers in DNNs by poisoning training data. A backdoored model behaves normally on …

被引用次数：335 相关文章所有 8 个版本

[PDF] thecvf.com

Ava: A video dataset of spatio-temporally localized atomic visual actions

C Gu, C Sun, DA Ross, C Vondrick… - Proceedings of the …, 2018 - openaccess.thecvf.com

This paper introduces a video dataset of spatio-temporally localized Atomic Visual Actions
(AVA). The AVA dataset densely annotates 80 atomic visual actions in 437 15-minute video …

被引用次数：1264 相关文章所有 20 个版本

[PDF] thecvf.com

Cola: Weakly-supervised temporal action localization with snippet contrastive learning

C Zhang, M Cao, D Yang, J Chen… - Proceedings of the …, 2021 - openaccess.thecvf.com

Weakly-supervised temporal action localization (WS-TAL) aims to localize actions in
untrimmed videos with only video-level labels. Most existing models follow the" localization …

被引用次数：173 相关文章所有 7 个版本

Spatio-temporal autoencoder for video anomaly detection

Y Zhao, B Deng, C Shen, Y Liu, H Lu… - Proceedings of the 25th …, 2017 - dl.acm.org

Anomalous events detection in real-world video scenes is a challenging problem due to the
complexity of" anomaly" as well as the cluttered backgrounds, objects and motions in the …

被引用次数：612 相关文章所有 2 个版本

[PDF] thecvf.com

Deep video deblurring for hand-held cameras

S Su, M Delbracio, J Wang, G Sapiro… - Proceedings of the …, 2017 - openaccess.thecvf.com

Motion blur from camera shake is a major problem in videos captured by hand-held devices.
Unlike single-image deblurring, video-based approaches can take advantage of the …

被引用次数：686 相关文章所有 12 个版本

[PDF] thecvf.com

Real-time action recognition with enhanced motion vector CNNs

B Zhang, L Wang, Z Wang, Y Qiao… - Proceedings of the …, 2016 - openaccess.thecvf.com

The deep two-stream architecture exhibited excellent performance on video based action
recognition. The most computationally expensive step in this approach comes from the …

被引用次数：535 相关文章所有 13 个版本

[PDF] ecva.net

Efficient spatio-temporal recurrent neural network for video deblurring

Z Zhong, Y Gao, Y Zheng, B Zheng - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer

Real-time video deblurring still remains a challenging task due to the complexity of spatially
and temporally varying blur itself and the requirement of low computational cost. To improve …

被引用次数：166 相关文章所有 4 个版本

[PDF] thecvf.com

Controllable video captioning with pos sequence guidance based on gated fusion network

B Wang, L Ma, W Zhang, W Jiang… - Proceedings of the …, 2019 - openaccess.thecvf.com

In this paper, we propose to guide the video caption generation with Part-of-Speech (POS)
information, based on a gated fusion of multiple representations of input videos. We …

被引用次数：225 相关文章所有 5 个版本

高级搜索

QQ 群

Analysis of the hands in egocentric vision: A survey

Animal pose estimation: A closer look at the state-of-the-art, existing gaps and opportunities

Clean-label backdoor attacks on video recognition models

Ava: A video dataset of spatio-temporally localized atomic visual actions

Cola: Weakly-supervised temporal action localization with snippet contrastive learning

Spatio-temporal autoencoder for video anomaly detection

Deep video deblurring for hand-held cameras

Real-time action recognition with enhanced motion vector CNNs

Efficient spatio-temporal recurrent neural network for video deblurring

Controllable video captioning with pos sequence guidance based on gated fusion network

引用