A survey on video-based human action recognition: recent updates, datasets, challenges, and applications

P Pareek, A Thakkar - Artificial Intelligence Review, 2021 - Springer
Abstract Human Action Recognition (HAR) involves human activity monitoring task in
different areas of medical, education, entertainment, visual surveillance, video retrieval, as …

Going deeper into action recognition: A survey

S Herath, M Harandi, F Porikli - Image and vision computing, 2017 - Elsevier
Understanding human actions in visual data is tied to advances in complementary research
areas including object recognition, human dynamics, domain adaptation and semantic …

[PDF][PDF] 智能视频监控技术综述

黄凯奇, 陈晓棠, 康运锋, 谭铁牛 - 计算机学报, 2015 - cjc.ict.ac.cn
摘要随着摄像头安装数量的日益增多, 以及智慧城市和公共安全需求的日益增长,
采用人工的视频监控方式已经远远不能满足需要, 因此智能视频监控技术应运而生并迅速成为 …

X-pool: Cross-modal language-video attention for text-video retrieval

SK Gorti, N Vouitsis, J Ma, K Golestan… - Proceedings of the …, 2022 - openaccess.thecvf.com
In text-video retrieval, the objective is to learn a cross-modal similarity function between a
text and a video that ranks relevant text-video pairs higher than irrelevant pairs. However …

Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer
Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

Human action recognition using attention based LSTM network with dilated CNN features

K Muhammad, A Ullah, AS Imran, M Sajjad… - Future Generation …, 2021 - Elsevier
Human action recognition in videos is an active area of research in computer vision and
pattern recognition. Nowadays, artificial intelligence (AI) based systems are needed for …

Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

Action recognition in video sequences using deep bi-directional LSTM with CNN features

A Ullah, J Ahmad, K Muhammad, M Sajjad… - IEEE …, 2017 - ieeexplore.ieee.org
Recurrent neural network (RNN) and long short-term memory (LSTM) have achieved great
success in processing sequential multimedia data and yielded the state-of-the-art results in …

Hollywood in homes: Crowdsourcing data collection for activity understanding

GA Sigurdsson, G Varol, X Wang, A Farhadi… - Computer Vision–ECCV …, 2016 - Springer
Computer vision has a great potential to help our daily lives by searching for lost keys,
watering flowers or reminding us to take a pill. To succeed with such tasks, computer vision …

Large-scale video classification with convolutional neural networks

A Karpathy, G Toderici, S Shetty, T Leung… - Proceedings of the …, 2014 - cv-foundation.org
Abstract Convolutional Neural Networks (CNNs) have been established as a powerful class
of models for image recognition problems. Encouraged by these results, we provide an …