Contextvp: Fully context-aware video prediction

W Byeon, Q Wang, RK Srivastava… - Proceedings of the …, 2018 - openaccess.thecvf.com
Video prediction models based on convolutional networks, recurrent networks, and their
combinations often result in blurry predictions. We identify an important contributing factor for …

Imaginator: Conditional spatio-temporal gan for video generation

Y Wang, P Bilinski, F Bremond… - Proceedings of the …, 2020 - openaccess.thecvf.com
Generating human videos based on single images entails the challenging simultaneous
generation of realistic and visual appealing appearance and motion. In this context, we …

S3vae: Self-supervised sequential vae for representation disentanglement and data generation

Y Zhu, MR Min, A Kadav… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
We propose a sequential variational autoencoder to learn disentangled representations of
sequential data (eg, videos and audios) under self-supervision. Specifically, we exploit the …

Dnerv: Modeling inherent dynamics via difference neural representation for videos

Q Zhao, MS Asif, Z Ma - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Existing implicit neural representation (INR) methods do not fully exploit spatiotemporal
redundancies in videos. Index-based INRs ignore the content-specific spatial features and …

Convtransformer: A convolutional transformer network for video frame synthesis

Z Liu, S Luo, W Li, J Lu, Y Wu, S Sun, C Li… - arXiv preprint arXiv …, 2020 - arxiv.org
Deep Convolutional Neural Networks (CNNs) are powerful models that have achieved
excellent performance on difficult computer vision tasks. Although CNNs perform well …

Flow-grounded spatial-temporal video prediction from still images

Y Li, C Fang, J Yang, Z Wang, X Lu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Existing video prediction methods mainly rely on observing multiple historical frames or
focus on predicting the next one-frame. In this work, we study the problem of generating …

[PDF][PDF] Videoflow: A flow-based generative model for video

M Kumar, M Babaeizadeh, D Erhan… - arXiv preprint arXiv …, 2019 - researchgate.net
Generative models that can model and predict sequences of future events can, in principle,
learn to capture complex real-world phenomena, such as physical interactions. In particular …

[PDF][PDF] Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies.

W Liu, W Luo, Z Li, P Zhao, S Gao - IJCAI, 2019 - ijcai.org
Classical semi-supervised video anomaly detection assumes that only normal data are
available in the training set because of the rare and unbounded nature of anomalies. It is …

Pre-training contextualized world models with in-the-wild videos for reinforcement learning

J Wu, H Ma, C Deng, M Long - Advances in Neural …, 2024 - proceedings.neurips.cc
Unsupervised pre-training methods utilizing large and diverse datasets have achieved
tremendous success across a range of domains. Recent work has investigated such …

Hierarchical long-term video prediction without supervision

R Villegas, D Erhan, H Lee - International Conference on …, 2018 - proceedings.mlr.press
Much of recent research has been devoted to video prediction and generation, yet most of
the previous works have demonstrated only limited success in generating videos on short …