A simple baseline for open-vocabulary semantic segmentation with pre-trained vision-language model

M Xu, Z Zhang, F Wei, Y Lin, Y Cao, H Hu… - European Conference on …, 2022 - Springer
Recently, open-vocabulary image classification by vision language pre-training has
demonstrated incredible achievements, that the model can classify arbitrary categories …

Kernelized few-shot object detection with efficient integral aggregation

S Zhang, L Wang, N Murray… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We design a Kernelized Few-shot Object Detector by leveraging kernelized
matrices computed over multiple proposal regions, which yield expressive non-linear …

Time-reversed diffusion tensor transformer: A new tenet of few-shot object detection

S Zhang, N Murray, L Wang, P Koniusz - European Conference on …, 2022 - Springer
In this paper, we tackle the challenging problem of Few-shot Object Detection. Existing
FSOD pipelines (i) use average-pooled representations that result in information loss; and/or …

Learning partial correlation based deep visual representation for image classification

S Rahman, P Koniusz, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual representation based on covariance matrix has demonstrates its efficacy for image
classification by characterising the pairwise correlation of different channels in convolutional …

Multi-level second-order few-shot learning

H Zhang, H Li, P Koniusz - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
We propose a Multi-level Second-order (MlSo) few-shot learning network for supervised or
unsupervised few-shot image classification and few-shot action recognition. We leverage so …

Convolutional fine-grained classification with self-supervised target relation regularization

K Liu, K Chen, K Jia - IEEE Transactions on Image Processing, 2022 - ieeexplore.ieee.org
Fine-grained visual classification can be addressed by deep representation learning under
supervision of manually pre-defined targets (eg, one-hot or the Hadamard codes). Such …

A Lie algebra representation for efficient 2D shape classification

X Yu, Y Gao, M Bennamoun, S Xiong - Pattern Recognition, 2023 - Elsevier
Riemannian manifold plays a vital role as a powerful mathematical tool in computer vision,
with important applications in curved shape analysis and classification. Significant progress …

Dropcov: A simple yet effective method for improving deep architectures

Q Wang, M Gao, Z Zhang, J Xie… - Advances in Neural …, 2022 - proceedings.neurips.cc
Previous works show global covariance pooling (GCP) has great potential to improve deep
architectures especially on visual recognition tasks, where post-normalization of GCP plays …

Towards a Deeper Understanding of Global Covariance Pooling in Deep Learning: An Optimization Perspective

Q Wang, Z Zhang, M Gao, J Xie, P Zhu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Global covariance pooling (GCP) as an effective alternative to global average pooling has
shown good capacity to improve deep convolutional neural networks (CNNs) in a variety of …

Efficient compact bilinear pooling via kronecker product

T Yu, Y Cai, P Li - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Bilinear pooling has achieved excellent performance in fine-grained recognition tasks.
Nevertheless, high-dimensional bilinear features suffer from over-fitting and inefficiency. To …