Z Guo, B Dong, Z Ji, J Bai, Y Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Prompt tuning has been employed as an efficient way to adapt large vision-language pre- trained models (eg CLIP) to various downstream tasks in data-limited or label-limited …
Z Ding, A Wang, H Chen, Q Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Multi-label recognition (MLR) with incomplete labels is very challenging. Recent works strive to explore the image-to-label correspondence in the vision-language model, ie, CLIP, to …
Y Kim, JM Kim, J Jeong, C Schmid… - Proceedings of the …, 2023 - openaccess.thecvf.com
Due to the expensive costs of collecting labels in multi-label classification datasets, partially annotated multi-label classification has become an emerging field in computer vision. One …
P Hu, X Sun, S Sclaroff… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Multi-label image recognition in the low-label regime is a task of great challenge and practical significance. Previous works have focused on learning the alignment between …
Video scene graph generation (VidSGG) aims to identify objects in visual scenes and infer their relationships for a given video. It requires not only a comprehensive understanding of …
Ingredient prediction has received more and more attention with the help of image processing for its diverse real-world applications, such as nutrition intake management and …
S Wang, Q Wan, X Xiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Partially annotated images are easy to obtain in multi-label classification. However, unknown labels in partially annotated images exacerbate the positive-negative imbalance …
X Zhao, Y Shen, S Wang, H Zhang - Pattern Recognition Letters, 2023 - Elsevier
Abstract Generalized Zero-Shot Learning (GZSL) has become an important research due to its powerful ability of recognizing unseen objects. Generative methods, converting …
Z Yuan, K Zhang, T Huang - arXiv preprint arXiv:2306.16016, 2023 - arxiv.org
Multi-label classification (MLC) suffers from the inevitable label noise in training data due to the difficulty in annotating various semantic labels in each image. To mitigate the influence …