Bi-directional relationship inferring network for referring image segmentation

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

被引用次数：153 相关文章所有 5 个版本

[PDF] thecvf.com

Gres: Generalized referring expression segmentation

C Liu, H Ding, X Jiang - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com

Abstract Referring Expression Segmentation (RES) aims to generate a segmentation mask
for the object described by a given language expression. Existing classic RES datasets and …

被引用次数：142 相关文章所有 6 个版本

[PDF] arxiv.org

Scaling open-vocabulary image segmentation with image-level labels

G Ghiasi, X Gu, Y Cui, TY Lin - European Conference on Computer Vision, 2022 - Springer

We design an open-vocabulary image segmentation model to organize an image into
meaningful regions indicated by arbitrary texts. Recent works (CLIP and ALIGN), despite …

被引用次数：443 相关文章所有 5 个版本

[PDF] thecvf.com

Cris: Clip-driven referring image segmentation

Z Wang, Y Lu, Q Li, X Tao, Y Guo… - Proceedings of the …, 2022 - openaccess.thecvf.com

Referring image segmentation aims to segment a referent via a natural linguistic expression.
Due to the distinct data properties between text and image, it is challenging for a network to …

被引用次数：371 相关文章所有 7 个版本

[PDF] thecvf.com

Lavt: Language-aware vision transformer for referring image segmentation

Z Yang, J Wang, Y Tang, K Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Referring image segmentation is a fundamental vision-language task that aims to segment
out an object referred to by a natural language expression from an image. One of the key …

被引用次数：320 相关文章所有 10 个版本

[PDF] thecvf.com

Bridging vision and language encoders: Parameter-efficient tuning for referring image segmentation

Z Xu, Z Chen, Y Zhang, Y Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

Parameter efficient tuning (PET) has received considerable attention owing to its
applicability to reduce the number of parameters that need to be updated while maintaining …

被引用次数：62 相关文章所有 5 个版本

[PDF] thecvf.com

Restr: Convolution-free referring image segmentation using transformers

N Kim, D Kim, C Lan, W Zeng… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Referring image segmentation is an advanced semantic segmentation task where target is
not a predefined class but is described in natural language. Most of existing methods for this …

被引用次数：159 相关文章所有 7 个版本

[PDF] thecvf.com

Vision-language transformer and query generation for referring segmentation

H Ding, C Liu, S Wang, X Jiang - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

In this work, we address the challenging task of referring segmentation. The query
expression in referring segmentation typically indicates the target object by describing its …

被引用次数：269 相关文章所有 7 个版本

[PDF] arxiv.org

VLT: Vision-language transformer and query generation for referring segmentation

H Ding, C Liu, S Wang, X Jiang - IEEE Transactions on Pattern …, 2022 - ieeexplore.ieee.org

We propose a Vision-Language Transformer (VLT) framework for referring segmentation to
facilitate deep interactions among multi-modal information and enhance the holistic …

被引用次数：120 相关文章所有 7 个版本

[PDF] arxiv.org

Seqtr: A simple yet universal network for visual grounding

C Zhu, Y Zhou, Y Shen, G Luo, X Pan, M Lin… - … on Computer Vision, 2022 - Springer

In this paper, we propose a simple yet universal network termed SeqTR for visual grounding
tasks, eg, phrase localization, referring expression comprehension (REC) and segmentation …

被引用次数：148 相关文章所有 5 个版本

高级搜索

QQ 群