[HTML][HTML] A review on the long short-term memory model

G Van Houdt, C Mosquera, G Nápoles - Artificial Intelligence Review, 2020 - Springer
Long short-term memory (LSTM) has transformed both machine learning and
neurocomputing fields. According to several online sources, this model has improved …

[HTML][HTML] Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges

HF Nweke, YW Teh, MA Al-Garadi, UR Alo - Expert Systems with …, 2018 - Elsevier
Human activity recognition systems are developed as part of a framework to enable
continuous monitoring of human behaviours in the area of ambient assisted living, sports …

Anticipative video transformer

R Girdhar, K Grauman - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Abstract We propose Anticipative Video Transformer (AVT), an end-to-end attention-based
video modeling architecture that attends to the previously observed video in order to …

End-to-end temporal action detection with transformer

X Liu, Q Wang, Y Hu, X Tang, S Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …

[HTML][HTML] Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

Rethinking the faster r-cnn architecture for temporal action localization

YW Chao, S Vijayanarasimhan… - Proceedings of the …, 2018 - openaccess.thecvf.com
We propose TAL-Net, an improved approach to temporal action localization in video that is
inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key …

Peeking into the future: Predicting future person activities and locations in videos

J Liang, L Jiang, JC Niebles… - Proceedings of the …, 2019 - openaccess.thecvf.com
Deciphering human behaviors to predict their future paths/trajectories and what they would
do from videos is important in many applications. Motivated by this idea, this paper studies …

Eidetic 3D LSTM: A model for video prediction and beyond

Y Wang, L Jiang, MH Yang, LJ Li, M Long… - International …, 2018 - openreview.net
Spatiotemporal predictive learning, though long considered to be a promising self-
supervised feature learning method, seldom shows its effectiveness beyond future video …

Tall: Temporal activity localization via language query

J Gao, C Sun, Z Yang… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
This paper focuses on temporal localization of actions from untrimmed videos. Existing
methods typically involve training classifiers for a pre-defined list of actions and applying the …

R-c3d: Region convolutional 3d network for temporal activity detection

H Xu, A Das, K Saenko - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We address the problem of activity detection in continuous, untrimmed video streams. This is
a difficult task that requires extracting meaningful spatio-temporal features to capture …