Referring image segmentation has sprung up benefiting from the outstanding performance of deep neural networks. However, most existing methods explore either local details or the …
H Li, M Sun, J Xiao, EG Lim… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Referring Expression Segmentation (RES), which is aimed at localizing and segmenting the target according to the given language expression, has drawn increasing attention. Existing …
Referring expression comprehension aims to localize a specific object in an image according to a given language description. It is still challenging to comprehend and mitigate …
Abstract Referring Image Segmentation (RIS) aims to extract the object or stuff from an image according to the given natural language expression. As a representative multi-modal …
W Zhang, Q Tan, P Li, Q Zhang, R Wang - Neurocomputing, 2023 - Elsevier
Referring image segmentation (RIS) aims to predict a segmentation mask for a target specified by a natural language expression. However, the existing methods failed to …
Y Zhang, Q Li, Y Pan, X Zhao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Video-based referring expression comprehension is a challenging task that requires locating the referred object in each video frame of a given video. While many existing …
H Zhang, L Wang, S Li, K Xu, B Yin - Neurocomputing, 2024 - Elsevier
Referring image segmentation aims to segment the instance corresponding to the given language description, which requires aligning information from two modalities. Existing …
Referring expression grounding is an important and challenging task in computer vision. To avoid the laborious annotation in conventional referring grounding, unpaired referring …
C Xie, Z Zhang, Y Wu, F Zhu, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Detecting objects based on language descriptions is a popular task that includes Open- Vocabulary object Detection (OVD) and Referring Expression Comprehension (REC). In this …