Referring multi-object tracking

D Wu, W Han, T Wang, X Dong… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing referring understanding tasks tend to involve the detection of a single text-referred
object. In this paper, we propose a new and general referring understanding task, termed …

Cross-modality synergy network for referring expression comprehension and segmentation

Q Li, Y Zhang, S Sun, J Wu, X Zhao, M Tan - Neurocomputing, 2022 - Elsevier
Referring expression comprehension and segmentation aim to locate and segment a
referred instance in an image according to a natural language expression. However …

Composed image retrieval via explicit erasure and replenishment with semantic alignment

G Zhang, S Wei, H Pang, S Qiu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Composed image retrieval aims at retrieving the desired images, given a reference image
and a text piece. To handle this task, two important subprocesses should be modeled …

Enhance Composed Image Retrieval via Multi-Level Collaborative Localization and Semantic Activeness Perception

G Zhang, S Wei, H Pang, S Qiu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Composed image retrieval (CIR) is an emerging and challenging research task that
combines two modalities, a reference image, and a modification text, into one query to …

Language-Conditioned Feature Pyramids for Visual Selection Tasks

T Iki, A Aizawa - Findings of the Association for Computational …, 2020 - aclanthology.org
Referring expression comprehension, which is the ability to locate language to an object in
an image, plays an important role in creating common ground. Many models that fuse visual …