A survey on graph neural networks and graph transformers in computer vision: a task-oriented perspective

C Chen, Y Wu, Q Dai, HY Zhou, M Xu, S Yang… - arXiv preprint arXiv …, 2022 - arxiv.org
Graph Neural Networks (GNNs) have gained momentum in graph representation learning
and boosted the state of the art in a variety of areas, such as data mining (\emph {eg,} social …

Infrared small and dim target detection with transformer under complex backgrounds

F Liu, C Gao, F Chen, D Meng, W Zuo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The infrared small and dim (S&D) target detection is one of the key techniques in the infrared
search and tracking system. Since the local regions similar to infrared S&D targets spread …

Label2label: A language modeling framework for multi-attribute learning

W Li, Z Cao, J Feng, J Zhou, J Lu - European Conference on Computer …, 2022 - Springer
Abstract Objects are usually associated with multiple attributes, and these attributes often
exhibit high correlations. Modeling complex relationships between attributes poses a great …

Ingredient-guided region discovery and relationship modeling for food category-ingredient prediction

Z Wang, W Min, Z Li, L Kang, X Wei… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Recognizing the category and its ingredient composition from food images facilitates
automatic nutrition estimation, which is crucial to various health relevant applications, such …

Recent progress in image denoising: A training strategy perspective

W Wu, M Chen, Y Xiang, Y Zhang… - IET Image …, 2023 - Wiley Online Library
Image denoising is one of the hottest topics in image restoration area, it has achieved great
progress both in terms of quantity and quality in recent years, especially after the wide and …

GNDAN: Graph navigated dual attention network for zero-shot learning

S Chen, Z Hong, G Xie, Q Peng, X You… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Zero-shot learning (ZSL) tackles the unseen class recognition problem by transferring
semantic knowledge from seen classes to unseen ones. Typically, to guarantee desirable …

Ingredient prediction via context learning network with class-adaptive asymmetric loss

M Luo, W Min, Z Wang, J Song… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Ingredient prediction has received more and more attention with the help of image
processing for its diverse real-world applications, such as nutrition intake management and …

Double attention based on graph attention network for image multi-label classification

W Zhou, Z Xia, P Dou, T Su, H Hu - ACM Transactions on Multimedia …, 2023 - dl.acm.org
The task of image multi-label classification is to accurately recognize multiple objects in an
input image. Most of the recent works need to leverage the label co-occurrence matrix …

Accurate and efficient large-scale multi-label learning with reduced feature broad learning system using label correlation

J Huang, CM Vong, CLP Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Multi-label learning for large-scale data is a grand challenge because of a large number of
labels with a complex data structure. Hence, the existing large-scale multi-label methods …

Graph attention transformer network for multi-label image classification

J Yuan, S Chen, Y Zhang, Z Shi, X Geng… - ACM Transactions on …, 2023 - dl.acm.org
Multi-label classification aims to recognize multiple objects or attributes from images. The
key to solving this issue relies on effectively characterizing the inter-label correlations or …