查看文章

thecvf.com 中的 [PDF]

Counting out time: Class agnostic video repetition counting in the wild

作者

Debidatta Dwibedi, Yusuf Aytar, Jonathan Tompson, Pierre Sermanet, Andrew Zisserman

发表日期

2020

研讨会论文

Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

页码范围

10387-10396

简介

We present an approach for estimating the period with which an action is repeated in a video. The crux of the approach lies in constraining the period prediction module to use temporal self-similarity as an intermediate representation bottleneck that allows generalization to unseen repetitions in videos in the wild. We train this model, called RepNet, with a synthetic dataset that is generated from a large unlabeled video collection by sampling short clips of varying lengths and repeating them with different periods and counts. This combination of synthetic data and a powerful yet constrained model, allows us to predict periods in a class-agnostic fashion. Our model substantially exceeds the state of the art performance on existing periodicity (PERTUBE) and repetition counting (QUVA) benchmarks. We also collect a new challenging dataset called Countix (90 times larger than existing datasets) which captures the challenges of repetition counting in real-world videos. Project webpage: https://sites. google. com/view/repnet.

引用总数

被引用次数：117

202020212022202320241 22 29 41 24

学术搜索中的文章

Counting out time: Class agnostic video repetition counting in the wild

D Dwibedi, Y Aytar, J Tompson, P Sermanet… - Proceedings of the IEEE/CVF conference on computer …, 2020

被引用次数：117 相关文章所有 8 个版本