Zero-shot visual reasoning by vision-language models: Benchmarking and analysis

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Zero-shot visual reasoning by vision-language models: Benchmarking and analysis

在引用文章中搜索

[PDF] arxiv.org

Synthetic Vision: Training Vision-Language Models to Understand Physics

V Balazadeh, M Ataei, H Cheong… - arXiv preprint arXiv …, 2024 - arxiv.org

Physical reasoning, which involves the interpretation, understanding, and prediction of
object behavior in dynamic environments, remains a significant challenge for current Vision …

被引用次数：1 相关文章

[PDF] arxiv.org

BehAV: Behavioral Rule Guided Autonomy Using VLMs for Robot Navigation in Outdoor Scenes

K Weerakoon, M Elnoor, G Seneviratne… - arXiv preprint arXiv …, 2024 - arxiv.org

We present BehAV, a novel approach for autonomous robot navigation in outdoor scenes
guided by human instructions and leveraging Vision Language Models (VLMs). Our method …

高级搜索

QQ 群

Zero-shot visual reasoning by vision-language models: Benchmarking and analysis

Synthetic Vision: Training Vision-Language Models to Understand Physics

BehAV: Behavioral Rule Guided Autonomy Using VLMs for Robot Navigation in Outdoor Scenes

引用