ZM Chen, XS Wei, P Wang… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label …
K Zhu, J Wu - Proceedings of the IEEE/CVF international …, 2021 - openaccess.thecvf.com
Multi-label image recognition is a challenging computer vision task of practical use. Progresses in this area, however, are often characterized by complicated methods, heavy …
J Ye, J He, X Peng, W Wu, Y Qiao - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Recent studies often exploit Graph Convolutional Network (GCN) to model label dependencies to improve recognition accuracy for multi-label image recognition. However …
T Chen, M Xu, X Hui, H Wu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Recognizing multiple labels of images is a practical and challenging task, and significant progress has been made by searching semantic-aware regions and modeling label …
T Chen, L Lin, R Chen, X Hui… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label …
The simultaneous recognition of multiple objects in one image remains a challenging task, spanning multiple events in the recognition field such as various object scales, inconsistent …
G Cheng, P Lai, D Gao, J Han - Science China Information Sciences, 2023 - Springer
Visual attention has become a popular and widely used component for image recognition. Although various attention-based methods have been proposed and achieved relatively …
Z Wang, T Chen, G Li, R Xu… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
This paper proposes a novel deep architecture to address multi-label image recognition, a fundamental and practical task towards general visual understanding. Current solutions for …
Z Guo, B Dong, Z Ji, J Bai, Y Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Prompt tuning has been employed as an efficient way to adapt large vision-language pre- trained models (eg CLIP) to various downstream tasks in data-limited or label-limited …