Two-level attention module based on spurious-3d residual networks for human action recognition

B Chen, F Meng, H Tang, G Tong - Sensors, 2023 - mdpi.com
In recent years, deep learning techniques have excelled in video action recognition.
However, currently commonly used video action recognition models minimize the …

GPT4Ego: unleashing the potential of pre-trained models for zero-shot egocentric action recognition

G Dai, X Shu, W Wu, R Yan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Vision-Language Models (VLMs), pre-trained on large-scale datasets, have shown
impressive performance in various visual recognition tasks. This advancement paves the …

Sequential order-aware coding-based robust subspace clustering for human action recognition in untrimmed videos

Z Zhou, C Ding, J Li, E Mohammadi… - … on Image Processing, 2022 - ieeexplore.ieee.org
Human action recognition (HAR) is one of most important tasks in video analysis. Since
video clips distributed on networks are usually untrimmed, it is required to accurately …

Human activity recognition using self-powered sensors based on multilayer bidirectional long short-term memory networks

J Su, Z Liao, Z Sheng, AX Liu, D Singh… - IEEE Sensors …, 2022 - ieeexplore.ieee.org
Sensor-based human activity recognition (HAR) requires the acquisition of channel state
information (CSI) data with time series based on sensors to predict human behavior. Many …

An individualized system of skeletal data-based CNN classifiers for action recognition in manufacturing assembly

M Al-Amin, R Qin, M Moniruzzaman, Z Yin… - Journal of Intelligent …, 2023 - Springer
Abstract Real-time Action Recognition (ActRgn) of assembly workers can timely assist
manufacturers in correcting human mistakes and improving task performance. Yet …

Global memory and local continuity for video object detection

L Han, Z Yin - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
To deal with the challenges in video object detection (VOD), such as occlusion and motion
blur, many state-of-the-art video object detectors adopt a feature aggregation module to …

Video is graph: Structured graph module for video action recognition

R Li, XJ Wu, T Xu - arXiv preprint arXiv:2110.05904, 2021 - arxiv.org
In the field of action recognition, video clips are always treated as ordered frames for
subsequent processing. To achieve spatio-temporal perception, existing approaches …

Temporal action segmentation with high-level complex activity labels

G Ding, A Yao - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
The temporal action segmentation task segments videos temporally and predicts action
labels for all frames. Fully supervising such a segmentation model requires dense frame …

Distortion map-guided feature rectification for efficient video semantic segmentation

J Xiong, LM Po, WY Yu, Y Zhao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
To leverage the strong cross-frame relations of videos, many video semantic segmentation
methods tend to explore feature reuse and feature warping based on motion clues …

Feature weakening, contextualization, and discrimination for weakly supervised temporal action localization

M Moniruzzaman, Z Yin - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Weakly-supervised Temporal Action Localization (W-TAL) aims to train a model to localize
all action instances potentially from different classes in an untrimmed video, using a training …