Object detection using deep learning, CNNs and vision transformers: A review

AB Amjoud, M Amrouch - IEEE Access, 2023 - ieeexplore.ieee.org
Detecting objects remains one of computer vision and image understanding applications'
most fundamental and challenging aspects. Significant advances in object detection have …

Convolutional neural networks or vision transformers: Who will win the race for action recognitions in visual data?

O Moutik, H Sekkat, S Tigani, A Chehri, R Saadane… - Sensors, 2023 - mdpi.com
Understanding actions in videos remains a significant challenge in computer vision, which
has been the subject of several pieces of research in the last decades. Convolutional neural …

Multi-level feature fusion for multimodal human activity recognition in Internet of Healthcare Things

MM Islam, S Nooruddin, F Karray, G Muhammad - Information Fusion, 2023 - Elsevier
Abstract Human Activity Recognition (HAR) has become a crucial element for smart
healthcare applications due to the fast adoption of wearable sensors and mobile …

A survey on 3d skeleton-based action recognition using learning method

B Ren, M Liu, R Ding, H Liu - Cyborg and Bionic Systems, 2024 - spj.science.org
Three-dimensional skeleton-based action recognition (3D SAR) has gained important
attention within the computer vision community, owing to the inherent advantages offered by …

A survey on video action recognition in sports: Datasets, methods and applications

F Wu, Q Wang, J Bian, N Ding, F Lu… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
To understand human behaviors, action recognition based on videos is a common
approach. Compared with image-based action recognition, videos provide much more …

Recent advances in LoRa: A comprehensive survey

Z Sun, H Yang, K Liu, Z Yin, Z Li, W Xu - ACM Transactions on Sensor …, 2022 - dl.acm.org
The vast demand for diverse applications raises new networking challenges, which have
encouraged the development of a new paradigm of Internet of Things (IoT), eg, LoRa. LoRa …

M3net: multi-view encoding, matching, and fusion for few-shot fine-grained action recognition

H Tang, J Liu, S Yan, R Yan, Z Li, J Tang - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Due to the scarcity of manually annotated data required for fine-grained video
understanding, few-shot fine-grained (FS-FG) action recognition has gained significant …

Efficient video transformers with spatial-temporal token selection

J Wang, X Yang, H Li, L Liu, Z Wu, YG Jiang - European Conference on …, 2022 - Springer
Video transformers have achieved impressive results on major video recognition
benchmarks, which however suffer from high computational cost. In this paper, we present …

Review on human action recognition in smart living: Sensing technology, multimodality, real-time processing, interoperability, and resource-constrained processing

G Diraco, G Rescio, P Siciliano, A Leone - Sensors, 2023 - mdpi.com
Smart living, a concept that has gained increasing attention in recent years, revolves around
integrating advanced technologies in homes and cities to enhance the quality of life for …

Transformer for skeleton-based action recognition: A review of recent advances

W Xin, R Liu, Y Liu, Y Chen, W Yu, Q Miao - Neurocomputing, 2023 - Elsevier
Skeleton-based action recognition has rapidly become one of the most popular and
essential research topics in computer vision. The task is to analyze the characteristics of …