Video transformers: A survey

J Selva, AS Johansen, S Escalera… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Transformer models have shown great success handling long-range interactions, making
them a promising tool for modeling video. However, they lack inductive biases and scale …

Dance with self-attention: A new look of conditional random fields on anomaly detection in videos

D Purwanto, YT Chen, WH Fang - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
This paper proposes a novel weakly supervised approach for anomaly detection, which
begins with a relation-aware feature extractor to capture the multi-scale convolutional neural …

Asymmetric masked distillation for pre-training small foundation models

Z Zhao, B Huang, S Xing, G Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Self-supervised foundation models have shown great potential in computer vision thanks to
the pre-training paradigm of masked autoencoding. Scale is a primary factor influencing the …

Deep learning for heterogeneous human activity recognition in complex iot applications

M Abdel-Basset, H Hawash, V Chang… - IEEE Internet of …, 2020 - ieeexplore.ieee.org
With continued improvements in wireless sensing technology, the notion of the Internet of
Things (IoT) has been widely adopted and has become pervasive owing to its broad …

Efficient action recognition via dynamic knowledge propagation

H Kim, M Jain, JT Lee, S Yun… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Efficient action recognition has become crucial to extend the success of action recognition to
many real-world applications. Contrary to most existing methods, which mainly focus on …

EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding

Y Xu, YL Li, Z Huang, MX Liu, C Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
With the surge in attention to Egocentric Hand-Object Interaction (Ego-HOI), large-scale
datasets such as Ego4D and EPIC-KITCHENS have been proposed. However, most current …

Relational reasoning for group activity recognition via self-attention augmented conditional random field

RRA Pramono, WH Fang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This paper presents a new relational network for group activity recognition. The essence of
the network is to integrate conditional random fields (CRFs) with self-attention to infer the …

Joint feature optimization and fusion for compressed action recognition

H Li, X Jiang, B Guan, RRM Tan… - … on Image Processing, 2021 - ieeexplore.ieee.org
Recent methods including CoViAR and DMC-Net provide a new paradigm for action
recognition since they are directly targeted at compressed videos (eg, MPEG4 files). It …

A teacher–student deep learning strategy for extreme low resolution unsafe action recognition in construction projects

M Yang, C Wu, Y Guo, Y He, R Jiang, J Jiang… - Advanced Engineering …, 2024 - Elsevier
A large proportion of construction accidents are caused by workers' unsafe actions. Due to
the complexity of the work environment and excessive demands of safety supervision on …

Human activity recognition and fall detection using convolutional neural network and transformer-based architecture

MAA Al-qaness, A Dahou, M Abd Elaziz… - … Signal Processing and …, 2024 - Elsevier
Abstract Human Activity Recognition (HAR) and fall detection, as applications within the field
of biomedical signal processing, are increasingly pivotal in enhancing patient care …