Few-shot object detection: Research advances and challenges

Z Xin, S Chen, T Wu, Y Shao, W Ding, X You - Information Fusion, 2024 - Elsevier
Object detection as a subfield within computer vision has achieved remarkable progress,
which aims to accurately identify and locate a specific object from images or videos. Such …

Transductive few-shot learning with prototype-based label propagation by iterative graph refinement

H Zhu, P Koniusz - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Few-shot learning (FSL) is popular due to its ability to adapt to novel classes. Compared
with inductive few-shot learning, transductive models typically perform better as they …

3mformer: Multi-order multi-mode transformer for skeletal action recognition

L Wang, P Koniusz - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Many skeletal action recognition models use GCNs to represent the human body by 3D
body joints connected body parts. GCNs aggregate one-or few-hop graph neighbourhoods …

Distilling self-supervised vision transformers for weakly-supervised few-shot classification & segmentation

D Kang, P Koniusz, M Cho… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We address the task of weakly-supervised few-shot image classification and segmentation,
by leveraging a Vision Transformer (ViT) pretrained with self-supervision. Our proposed …

A survey of deep learning for low-shot object detection

Q Huang, H Zhang, M Xue, J Song, M Song - ACM Computing Surveys, 2023 - dl.acm.org
Object detection has achieved a huge breakthrough with deep neural networks and massive
annotated data. However, current detection methods cannot be directly transferred to the …

Fs-detr: Few-shot detection transformer with prompting and without re-training

A Bulat, R Guerrero, B Martinez… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper is on Few-Shot Object Detection (FSOD), where given a few templates
(examples) depicting a novel class (not seen during training), the goal is to detect all of its …

Temporal-viewpoint transportation plan for skeletal few-shot action recognition

L Wang, P Koniusz - … of the Asian Conference on Computer …, 2022 - openaccess.thecvf.com
We propose a Few-shot Learning pipeline for 3D skeleton-based action recognition by Joint
Temporal and Camera Viewpoint Alignment. To factor out misalignment between query and …

Learning partial correlation based deep visual representation for image classification

S Rahman, P Koniusz, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Visual representation based on covariance matrix has demonstrates its efficacy for image
classification by characterising the pairwise correlation of different channels in convolutional …

Learning spatial-context-aware global visual feature representation for instance image retrieval

Z Zhang, L Wang, L Zhou… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In instance image retrieval, considering local spatial information within an image has proven
effective to boost retrieval performance, as demonstrated by local visual descriptor based …

Exploiting field dependencies for learning on categorical data

Z Li, P Koniusz, L Zhang, DE Pagendam… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Traditional approaches for learning on categorical data underexploit the dependencies
between columns (aka fields) in a dataset because they rely on the embedding of data …