A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

Self-attention convlstm for spatiotemporal prediction

Z Lin, M Li, Z Zheng, Y Cheng, C Yuan - Proceedings of the AAAI …, 2020 - ojs.aaai.org
Spatiotemporal prediction is challenging due to the complex dynamic motion and
appearance changes. Existing work concentrates on embedding additional cells into the …

Pie: A large-scale dataset and models for pedestrian intention estimation and trajectory prediction

A Rasouli, I Kotseruba, T Kunic… - Proceedings of the …, 2019 - openaccess.thecvf.com
Pedestrian behavior anticipation is a key challenge in the design of assistive and
autonomous driving systems suitable for urban environments. An intelligent system should …

A survey on generative ai and llm for video generation, understanding, and streaming

P Zhou, L Wang, Z Liu, Y Hao, P Hui, S Tarkoma… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper offers an insightful examination of how currently top-trending AI technologies, ie,
generative artificial intelligence (Generative AI) and large language models (LLMs), are …

Latency-aware collaborative perception

Z Lei, S Ren, Y Hu, W Zhang, S Chen - European Conference on …, 2022 - Springer
Collaborative perception has recently shown great potential to improve perception
capabilities over single-agent perception. Existing collaborative perception methods usually …

Disentangling physical dynamics from unknown factors for unsupervised video prediction

VL Guen, N Thome - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
Leveraging physical knowledge described by partial differential equations (PDEs) is an
appealing way to improve unsupervised video forecasting models. Since physics is too …

Eidetic 3D LSTM: A model for video prediction and beyond

Y Wang, L Jiang, MH Yang, LJ Li, M Long… - International …, 2018 - openreview.net
Spatiotemporal predictive learning, though long considered to be a promising self-
supervised feature learning method, seldom shows its effectiveness beyond future video …

Memory in memory: A predictive neural network for learning higher-order non-stationarity from spatiotemporal dynamics

Y Wang, J Zhang, H Zhu, M Long… - Proceedings of the …, 2019 - openaccess.thecvf.com
Natural spatiotemporal processes can be highly non-stationary in many ways, eg the low-
level non-stationarity such as spatial correlations or temporal dependencies of local pixel …