Human action recognition from various data modalities: A review

Z Sun, Q Ke, H Rahmani, M Bennamoun… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …

Beyond one-hot encoding: Lower dimensional target embedding

P Rodríguez, MA Bautista, J Gonzalez… - Image and Vision …, 2018 - Elsevier
Target encoding plays a central role when learning Convolutional Neural Networks. In this
realm, one-hot encoding is the most prevalent strategy due to its simplicity. However, this so …

Generating visual explanations

LA Hendricks, Z Akata, M Rohrbach, J Donahue… - Computer Vision–ECCV …, 2016 - Springer
Clearly explaining a rationale for a classification decision to an end user can be as important
as the decision itself. Existing approaches for deep visual recognition are generally opaque …

Learning a discriminative filter bank within a CNN for fine-grained recognition

Y Wang, VI Morariu, LS Davis - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Compared to earlier multistage frameworks using CNN features, recent end-to-end deep
approaches for fine-grained recognition essentially enhance the mid-level learning …

Convolutional neural network-based finger-vein recognition using NIR image sensors

HG Hong, MB Lee, KR Park - Sensors, 2017 - mdpi.com
Conventional finger-vein recognition systems perform recognition based on the finger-vein
lines extracted from the input images or image enhancement, and texture feature extraction …

Training interpretable convolutional neural networks by differentiating class-specific filters

H Liang, Z Ouyang, Y Zeng, H Su, Z He, ST Xia… - Computer Vision–ECCV …, 2020 - Springer
Convolutional neural networks (CNNs) have been successfully used in a range of tasks.
However, CNNs are often viewed as “black-box” and lack of interpretability. One main …

[HTML][HTML] Kernelized supervised laplacian eigenmap for visualization and classification of multi-label data

M Tai, M Kudo, A Tanaka, H Imai, K Kimura - Pattern Recognition, 2022 - Elsevier
We had previously proposed a supervised Laplacian eigenmap for visualization (SLE-ML)
that can handle multi-label data. In addition, SLE-ML can control the trade-off between the …

Learning spatiotemporal features for infrared action recognition with 3d convolutional neural networks

Z Jiang, V Rozgic, S Adali - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
While the action recognition task on videos collected from visible spectrum imaging has
received much attention, action recognition in infrared (IR) videos is significantly less …

A study on multimodal and interactive explanations for visual question answering

K Alipour, JP Schulze, Y Yao, A Ziskind… - arXiv preprint arXiv …, 2020 - arxiv.org
Explainability and interpretability of AI models is an essential factor affecting the safety of AI.
While various explainable AI (XAI) approaches aim at mitigating the lack of transparency in …

Deep residual infrared action recognition by integrating local and global spatio-temporal cues

J Imran, B Raman - Infrared Physics & Technology, 2019 - Elsevier
Human action recognition (HAR) is an important area of research in the field of computer
vision. Though a lot of efforts have been made in the past for HAR in visible spectrum, yet …