Probabilistic representations for video contrastive learning

J Park, J Lee, IJ Kim, K Sohn - Proceedings of the IEEE/CVF …, 2022 - openaccess.thecvf.com
… the advantages of contrastive learning, we propose probabilistic … Probabilistic Representations
for Video Contrastive Learning (ProViCo), in which videos are represented as probability

Spatiotemporal contrastive video representation learning

R Qian, T Meng, B Gong, MH Yang… - Proceedings of the …, 2021 - openaccess.thecvf.com
… If we sample temporally distant clips with smaller probabilities, the contrastive loss (Equation
1) would focus more on the temporally close clips, pulling their features closer and …

Active contrastive learning of audio-visual video representations

S Ma, Z Zeng, D McDuff, Y Song - arXiv preprint arXiv:2009.09805, 2020 - arxiv.org
… has been shown to produce generalizable representations of audio and visual data by …
representations for downstream tasks. In this paper, we propose an active contrastive learning

Expectation-maximization contrastive learning for compact video-and-language representations

P Jin, J Huang, F Liu, X Wu, S Ge… - Advances in neural …, 2022 - proceedings.neurips.cc
… use contrastive learning to preserve video-text semantic relatedness. By maximizing the joint
posterior probability of video and text, we find a semantically related subspace for compact …

Video representation learning by dense predictive coding

T Han, W Xie, A Zisserman - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
… for selfsupervised representation learning on videos. This … future representations; Second,
we propose a curriculum training … , contrast, saturation, hue and random greyscale during …

Scvrl: Shuffled contrastive video representation learning

M Dorkenwald, F Xiao, B Brattoli… - Proceedings of the …, 2022 - openaccess.thecvf.com
… We evaluate the effect of our probabilistic targeted sampling on the final performance. We
observe a performance boost with targeted sampling (temperature β = 5) compared to uniform …

Contrastive predictive coding with transformer for video representation learning

Y Liu, J Ma, Y Xie, X Yang, X Tao, L Peng, W Gao - Neurocomputing, 2022 - Elsevier
… Next, the context c is passed to a spatial pooling layer to get a feature vector, and then a
fully-connected layer and a multi-class softmax function outputs the probabilities for video action …

Contrastive representation learning: A framework and review

PH Le-Khac, G Healy, AF Smeaton - Ieee Access, 2020 - ieeexplore.ieee.org
videos with multiple viewpoints as in Time-Contrastive … property of video and applies contrastive
learning on a sequence … to learn a language model using a probabilistic contrastive loss, …

Temporal contrastive pretraining for video action recognition

G Lorre, J Rabarisoa, A Orcesi… - Proceedings of the …, 2020 - openaccess.thecvf.com
video representation learning based on Contrastive Predictive Coding (CPC) [27]. Previously,
CPC has been used to learn representations … and contrastive estimation to learn long-term …

Self-supervised video representation learning with odd-one-out networks

B Fernando, H Bilen, E Gavves… - Proceedings of the …, 2017 - openaccess.thecvf.com
… In contrast, we focus on learning the motion patterns within … Therefore the CNN returns the
conditional probability of action … log probability of the class y given video X is obtained by ∑m …