A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions

SK Yadav, K Tiwari, HM Pandey, SA Akbar - Knowledge-Based Systems, 2021 - Elsevier
Human activity recognition (HAR) is one of the most important and challenging problems in
the computer vision. It has critical application in wide variety of tasks including gaming …

A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

Earthformer: Exploring space-time transformers for earth system forecasting

Z Gao, X Shi, H Wang, Y Zhu… - Advances in …, 2022 - proceedings.neurips.cc
Conventionally, Earth system (eg, weather and climate) forecasting relies on numerical
simulation with complex physical models and hence is both expensive in computation and …

Predrnn: A recurrent neural network for spatiotemporal predictive learning

Y Wang, H Wu, J Zhang, Z Gao, J Wang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
The predictive learning of spatiotemporal sequences aims to generate future images by
learning from the historical context, where the visual dynamics are believed to have modular …

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

Latency-aware collaborative perception

Z Lei, S Ren, Y Hu, W Zhang, S Chen - European Conference on …, 2022 - Springer
Collaborative perception has recently shown great potential to improve perception
capabilities over single-agent perception. Existing collaborative perception methods usually …

A hybrid-convolution spatial–temporal recurrent network for traffic flow prediction

X Zhang, S Wen, L Yan, J Feng, Y Xia - The Computer Journal, 2024 - academic.oup.com
Accurate traffic flow prediction is valuable for satisfying citizens' travel needs and alleviating
urban traffic pressure. However, it is highly challenging due to the complexity of the urban …

Temporal attention unit: Towards efficient spatiotemporal predictive learning

C Tan, Z Gao, L Wu, Y Xu, J Xia… - Proceedings of the …, 2023 - openaccess.thecvf.com
Spatiotemporal predictive learning aims to generate future frames by learning from historical
frames. In this paper, we investigate existing methods and present a general framework of …

Disentangling physical dynamics from unknown factors for unsupervised video prediction

VL Guen, N Thome - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
Leveraging physical knowledge described by partial differential equations (PDEs) is an
appealing way to improve unsupervised video forecasting models. Since physics is too …