LGR-NET: Language Guided Reasoning Network for Referring Expression Comprehension

文章

学术资源搜索

获得 4 条结果（用时0.06秒）

我的图书馆

LGR-NET: Language Guided Reasoning Network for Referring Expression Comprehension

在引用文章中搜索

[PDF] thecvf.com

Revisiting Counterfactual Problems in Referring Expression Comprehension

Z Yu, R Li - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Traditional referring expression comprehension (REC) aims to locate the target referent in
an image guided by a text query. Several previous methods have studied on the …

被引用次数：2 相关文章

A Masked Reference Token Supervision based Iterative Visual-language Framework for Robust Visual Grounding

C Wang, W Feng, S Lyu, G Cheng, X Li… - … on Circuits and …, 2024 - ieeexplore.ieee.org

Visual Grounding (VG) has become a prominent task in recent years, achieving significant
advancements with the development of detection and vision transformers. However, existing …

[PDF] arxiv.org

ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding

M Zheng, J Zhang, Q Chen, Y Peng, Y Liu - arXiv preprint arXiv …, 2024 - arxiv.org

Visual grounding aims to localize the object referred to in an image based on a natural
language query. Although progress has been made recently, accurately localizing target …

MFSC: A Multimodal Aspect-Level Sentiment Classification Framework with Multi-Image Gate and Fusion Networks

L Zi, X Pan, X Cong - Electronics, 2024 - mdpi.com

Currently, there is a great deal of interest in multimodal aspect-level sentiment classification
using both textual and visual information, which changes the traditional use of only single …

高级搜索

QQ 群

LGR-NET: Language Guided Reasoning Network for Referring Expression Comprehension

Revisiting Counterfactual Problems in Referring Expression Comprehension

A Masked Reference Token Supervision based Iterative Visual-language Framework for Robust Visual Grounding

ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding

MFSC: A Multimodal Aspect-Level Sentiment Classification Framework with Multi-Image Gate and Fusion Networks

引用