相关文章- 学术资源搜索

Shuffle and learn: unsupervised learning using temporal order verification

I Misra, CL Zitnick, M Hebert - … , The Netherlands, October 11–14, 2016 …, 2016 - Springer

In this paper, we present an approach for learning a visual representation from the raw
spatiotemporal signals in videos. Our representation is learned without supervision from …

被引用次数：1006 相关文章所有 4 个版本

[PDF] arxiv.org

Video representation learning by recognizing temporal transformations

S Jenni, G Meishvili, P Favaro - European conference on computer vision, 2020 - Springer

We introduce a novel self-supervised learning approach to learn representations of videos
that are responsive to changes in the motion dynamics. Our representations can be learned …

被引用次数：135 相关文章所有 7 个版本

[PDF] thecvf.com

Unsupervised representation learning by sorting sequences

HY Lee, JB Huang, M Singh… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We present an unsupervised representation learning approach using videos without
semantic labels. We leverage the temporal coherence as a supervisory signal by formulating …

被引用次数：631 相关文章所有 13 个版本

[PDF] arxiv.org

Video representation learning with visual tempo consistency

C Yang, Y Xu, B Dai, B Zhou - arXiv preprint arXiv:2006.15489, 2020 - arxiv.org

Visual tempo, which describes how fast an action goes, has shown its potential in
supervised action recognition. In this work, we demonstrate that visual tempo can also serve …

被引用次数：93 相关文章所有 2 个版本

[PDF] mlr.press

Learning end-to-end video classification with rank-pooling

B Fernando, S Gould - International Conference on Machine …, 2016 - proceedings.mlr.press

We introduce a new model for representation learning and classification of video
sequences. Our model is based on a convolutional neural network coupled with a novel …

被引用次数：106 相关文章所有 15 个版本

[PDF] thecvf.com

Self-supervised spatiotemporal learning via video clip order prediction

D Xu, J Xiao, Z Zhao, J Shao, D Xie… - Proceedings of the …, 2019 - openaccess.thecvf.com

We propose a self-supervised spatiotemporal learning technique which leverages the
chronological order of videos. Our method can learn the spatiotemporal representation of …

被引用次数：500 相关文章所有 9 个版本

[PDF] mlr.press

Unsupervised learning of video representations using lstms

N Srivastava, E Mansimov… - … on machine learning, 2015 - proceedings.mlr.press

Abstract We use Long Short Term Memory (LSTM) networks to learn representations of
video sequences. Our model uses an encoder LSTM to map an input sequence into a fixed …

被引用次数：3189 相关文章所有 19 个版本

[PDF] thecvf.com

Only time can tell: Discovering temporal data for temporal modeling

L Sevilla-Lara, S Zha, Z Yan… - Proceedings of the …, 2021 - openaccess.thecvf.com

Understanding temporal information and how the visual world changes over time, is a
fundamental ability of intelligent systems. In video understanding, temporal information is at …

被引用次数：80 相关文章所有 7 个版本

[PDF] thecvf.com

Temporal contrastive pretraining for video action recognition

G Lorre, J Rabarisoa, A Orcesi… - Proceedings of the …, 2020 - openaccess.thecvf.com

In this paper, we propose a self-supervised method for video representation learning based
on Contrastive Predictive Coding (CPC)[27]. Previously, CPC has been used to learn …

被引用次数：48 相关文章所有 6 个版本

[PDF] thecvf.com

Deep temporal linear encoding networks

A Diba, V Sharma, L Van Gool - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

The CNN-encoding of features from entire videos for the representation of human actions
has rarely been addressed. Instead, CNN work has focused on approaches to fuse spatial …

被引用次数：299 相关文章所有 12 个版本

高级搜索

QQ 群

Shuffle and learn: unsupervised learning using temporal order verification

Video representation learning by recognizing temporal transformations

Unsupervised representation learning by sorting sequences

Video representation learning with visual tempo consistency

Learning end-to-end video classification with rank-pooling

Self-supervised spatiotemporal learning via video clip order prediction

Unsupervised learning of video representations using lstms

Only time can tell: Discovering temporal data for temporal modeling

Temporal contrastive pretraining for video action recognition

Deep temporal linear encoding networks

相关搜索

引用