A comparison of pooling methods for convolutional neural networks

A Zafar, M Aamir, N Mohd Nawi, A Arshad, S Riaz… - Applied Sciences, 2022 - mdpi.com
One of the most promising techniques used in various sciences is deep neural networks
(DNNs). A special type of DNN called a convolutional neural network (CNN) consists of …

Learning partial correlation based deep visual representation for image classification

S Rahman, P Koniusz, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual representation based on covariance matrix has demonstrates its efficacy for image
classification by characterising the pairwise correlation of different channels in convolutional …

Convolutional fine-grained classification with self-supervised target relation regularization

K Liu, K Chen, K Jia - IEEE Transactions on Image Processing, 2022 - ieeexplore.ieee.org
Fine-grained visual classification can be addressed by deep representation learning under
supervision of manually pre-defined targets (eg, one-hot or the Hadamard codes). Such …

Crossformer: Cross spatio-temporal transformer for 3d human pose estimation

M Hassanin, A Khamiss, M Bennamoun… - arXiv preprint arXiv …, 2022 - arxiv.org
3D human pose estimation can be handled by encoding the geometric dependencies
between the body parts and enforcing the kinematic constraints. Recently, Transformer has …

Fine-grained image classification via multi-scale selective hierarchical biquadratic pooling

M Tan, F Yuan, J Yu, G Wang, X Gu - ACM Transactions on Multimedia …, 2022 - dl.acm.org
How to extract distinctive features greatly challenges the fine-grained image classification
tasks. In previous models, bilinear pooling has been frequently adopted to address this …

Bi-STAN: bilinear spatial-temporal attention network for wearable human activity recognition

C Gao, Y Chen, X Jiang, L Hu, Z Zhao… - International Journal of …, 2023 - Springer
With the progressive development of ubiquitous computing, wearable human activity
recognition is playing an increasingly important role in many fields, such as health …

Efficient compact bilinear pooling via kronecker product

T Yu, Y Cai, P Li - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Bilinear pooling has achieved excellent performance in fine-grained recognition tasks.
Nevertheless, high-dimensional bilinear features suffer from over-fitting and inefficiency. To …

3D object representation learning: A set-to-set matching perspective

T Yu, J Meng, M Yang, J Yuan - IEEE Transactions on Image …, 2021 - ieeexplore.ieee.org
In this paper, we tackle the 3D object representation learning from the perspective of set-to-
set matching. Given two 3D objects, calculating their similarity is formulated as the problem …

Improved Bilinear Pooling With Pseudo Square-Rooted Matrix

S Xu, D Muselet, A Trémeau… - IEEE Signal Processing …, 2023 - ieeexplore.ieee.org
Bilinear pooling is a feature aggregation step applied after the convolutional layers of a
deep network and encodes a matrix of local features into a fixed-size bilinear representation …

GBP: Graph convolutional network embedded in bilinear pooling for fine-grained encoding

Y Du, J Tang, T Rui, X Li, C Yang - Computers and Electrical Engineering, 2024 - Elsevier
In fine-grained recognition, classical high-order coding has inherent contradiction between
visual burstiness and feature redundancy, the core of which is the inherent instability of high …