Y Wu, Z Zhang, X Chi, F Zhu, R Zhao - arXiv preprint arXiv:2305.12452, 2023 - arxiv.org
Referring Expression Segmentation (RES) is a widely explored multi-modal task, which
endeavors to segment the pre-existing object within a single image with a given linguistic …