A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

Folded recurrent neural networks for future video prediction

M Oliu, J Selva, S Escalera - Proceedings of the European …, 2018 - openaccess.thecvf.com
This work introduces double-mapping Gated Recurrent Units (dGRU), an extension of
standard GRUs where the input is considered as a recurrent state. An extra set of logic gates …

Decomposing motion and content for natural video sequence prediction

R Villegas, J Yang, S Hong, X Lin, H Lee - arXiv preprint arXiv:1706.08033, 2017 - arxiv.org
We propose a deep neural network for the prediction of future frames in natural video
sequences. To effectively handle complex evolution of pixels in videos, we propose to …

Extdm: Distribution extrapolation diffusion model for video prediction

Z Zhang, J Hu, W Cheng, D Paudel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …

Video processing using deep learning techniques: A systematic literature review

V Sharma, M Gupta, A Kumar, D Mishra - IEEE Access, 2021 - ieeexplore.ieee.org
Studies show lots of advanced research on various data types such as image, speech, and
text using deep learning techniques, but nowadays, research on video processing is also an …

Fitvid: Overfitting in pixel-level video prediction

M Babaeizadeh, MT Saffar, S Nair, S Levine… - arXiv preprint arXiv …, 2021 - arxiv.org
An agent that is capable of predicting what happens next can perform a variety of tasks
through planning with no additional training. Furthermore, such an agent can internally …

Transformation-based models of video sequences

J Van Amersfoort, A Kannan, MA Ranzato… - arXiv preprint arXiv …, 2017 - arxiv.org
In this work we propose a simple unsupervised approach for next frame prediction in video.
Instead of directly predicting the pixels in a frame given past frames, we predict the …

Flexible spatio-temporal networks for video prediction

C Lu, M Hirsch, B Scholkopf - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We describe a modular framework for video frame prediction. We refer to it as a Flexible
Spatio-Temporal Network (FSTN) as it allows the extrapolation of a video sequence as well …

Hierarchical long-term video prediction without supervision

R Villegas, D Erhan, H Lee - International Conference on …, 2018 - proceedings.mlr.press
Much of recent research has been devoted to video prediction and generation, yet most of
the previous works have demonstrated only limited success in generating videos on short …