所有版本 - 学术资源搜索

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

Pivot: Iterative visual prompting elicits actionable knowledge for vlms

S Nasiriany, F Xia, W Yu, T Xiao, J Liang… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision language models (VLMs) have shown impressive capabilities across a variety of
tasks, from logical reasoning to visual understanding. This opens the door to richer …

被引用次数：20 相关文章

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

S Nasiriany, F Xia, W Yu, T Xiao, J Liang… - Forty-first International … - openreview.net

Vision language models (VLMs) have shown impressive capabilities across a variety of
tasks, from logical reasoning to visual understanding. This opens the door to richer …

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

S Nasiriany, F Xia, W Yu, T Xiao, J Liang… - arXiv e …, 2024 - ui.adsabs.harvard.edu

Vision language models (VLMs) have shown impressive capabilities across a variety of
tasks, from logical reasoning to visual understanding. This opens the door to richer …

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

S Nasiriany, F Xia, W Yu, T Xiao, J Liang… - First Workshop on Vision … - openreview.net

Vision language models (VLMs) have shown impressive capabilities across a variety of
tasks, from logical reasoning to visual understanding. This opens the door to richer …

高级搜索

QQ 群

Pivot: Iterative visual prompting elicits actionable knowledge for vlms

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

引用