Graphadapter: Tuning vision-language models with dual knowledge graph

X Li, D Lian, Z Lu, J Bai, Z Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Adapter-style efficient transfer learning (ETL) has shown excellent performance in the tuning
of vision-language models (VLMs) under the low-data regime, where only a few additional …

Task residual for tuning vision-language models

T Yu, Z Lu, X Jin, Z Chen… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Large-scale vision-language models (VLMs) pre-trained on billion-level data have learned
general visual representations and broad visual concepts. In principle, the well-learned …

Dynamic correlation learning and regularization for multi-label confidence calibration

T Chen, W Wang, T Pu, J Qin, Z Yang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Modern visual recognition models often display overconfidence due to their reliance on
complex deep neural networks and one-hot target supervision, resulting in unreliable …

[HTML][HTML] Frozen is better than learning: A new design of prototype-based classifier for semantic segmentation

J Chen, D Deguchi, C Zhang, X Zheng, H Murase - Pattern Recognition, 2024 - Elsevier
Semantic segmentation models comprise an encoder to extract features and a classifier for
prediction. However, the learning of the classifier suffers from the ambiguity which is caused …

Ecea: Extensible co-existing attention for few-shot object detection

Z Xin, T Wu, S Chen, Y Zou, L Shao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Few-shot object detection (FSOD) identifies objects from extremely few annotated samples.
Most existing FSOD methods, recently, apply the two-stage learning paradigm, which …

Beyond sole strength: Customized ensembles for generalized vision-language models

Z Lu, J Bai, X Li, Z Xiao, X Wang - arXiv preprint arXiv:2311.17091, 2023 - arxiv.org
Fine-tuning pre-trained vision-language models (VLMs), eg, CLIP, for the open-world
generalization has gained increasing popularity due to its practical value. However …

Zero-shot sketch-based image retrieval via adaptive relation-aware metric learning

Y Liu, Y Dang, X Gao, J Han, L Shao - Pattern Recognition, 2024 - Elsevier
Retrieving natural images with the query sketches under the zero-shot scenario is known as
zero-shot sketch-based image retrieval (ZS-SBIR). Most of the best-performing methods …

ENInst: Enhancing weakly-supervised low-shot instance segmentation

M Ye-Bin, D Choi, Y Kwon, J Kim, TH Oh - Pattern Recognition, 2024 - Elsevier
We address a weakly-supervised low-shot instance segmentation, an annotation-efficient
training method to deal with novel classes effectively. Since it is an under-explored problem …

Rethinking Prior Information Generation with CLIP for Few-Shot Segmentation

J Wang, B Zhang, J Pang, H Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
Few-shot segmentation remains challenging due to the limitations of its labeling information
for unseen classes. Most previous approaches rely on extracting high-level feature maps …

Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework

Z Li, F Lu, J Zou, L Hu, H Zhang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Land-cover mapping is one of the vital applications in Earth observation aiming at
classifying each pixel's land-cover type of remote-sensing images. As natural and human …