3mformer: Multi-order multi-mode transformer for skeletal action recognition

L Wang, P Koniusz - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Many skeletal action recognition models use GCNs to represent the human body by 3D
body joints connected body parts. GCNs aggregate one-or few-hop graph neighbourhoods …

[HTML][HTML] Robust attentional aggregation of deep feature sets for multi-view 3D reconstruction

B Yang, S Wang, A Markham, N Trigoni - International Journal of …, 2020 - Springer
We study the problem of recovering an underlying 3D shape from a set of images. Existing
learning based approaches usually resort to recurrent neural nets, eg, GRU, or intuitive …

Stochastic partial swap: Enhanced model generalization and interpretability for fine-grained recognition

S Huang, X Wang, D Tao - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Learning mid-level representation for fine-grained recognition is easily dominated by a
limited number of highly discriminative patterns, degrading its robustness and generalization …

Tensor representations for action recognition

P Koniusz, L Wang, A Cherian - IEEE Transactions on Pattern …, 2021 - ieeexplore.ieee.org
Human actions in video sequences are characterized by the complex interplay between
spatial features and their temporal dynamics. In this paper, we propose novel tensor …

Power normalizations in fine-grained image, few-shot image and graph classification

P Koniusz, H Zhang - IEEE Transactions on Pattern Analysis …, 2021 - ieeexplore.ieee.org
Power Normalizations (PN) are useful non-linear operators which tackle feature imbalances
in classification problems. We study PNs in the deep learning setup via a novel PN layer …

Learning partial correlation based deep visual representation for image classification

S Rahman, P Koniusz, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual representation based on covariance matrix has demonstrates its efficacy for image
classification by characterising the pairwise correlation of different channels in convolutional …

Learning spatial-context-aware global visual feature representation for instance image retrieval

Z Zhang, L Wang, L Zhou… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In instance image retrieval, considering local spatial information within an image has proven
effective to boost retrieval performance, as demonstrated by local visual descriptor based …

Multi-level second-order few-shot learning

H Zhang, H Li, P Koniusz - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or
unsupervised few-shot image classification and few-shot action recognition. We leverage so …

Deep residual pooling network for texture recognition

S Mao, D Rajan, LT Chia - pattern Recognition, 2021 - Elsevier
Current deep learning-based texture recognition methods extract spatial orderless features
from pre-trained deep learning models that are trained on large-scale image datasets …

Exploiting field dependencies for learning on categorical data

Z Li, P Koniusz, L Zhang, DE Pagendam… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Traditional approaches for learning on categorical data underexploit the dependencies
between columns (aka fields) in a dataset because they rely on the embedding of data …