Look, remember and reason: Visual reasoning with grounded rationales

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Look, remember and reason: Visual reasoning with grounded rationales

在引用文章中搜索

[PDF] tandfonline.com

GeoAI in urban analytics

S De Sabbata, A Ballatore, HJ Miller… - International Journal …, 2023 - Taylor & Francis

We are writing this editorial piece at the peak of the current Artificial Intelligence
(AI)'spring'as generative models quickly cross the bridge from the confines of academic and …

被引用次数：11 相关文章所有 5 个版本

[PDF] thecvf.com

Painter: Teaching Auto-regressive Language Models to Draw Sketches

R Pourreza, A Bhattacharyya… - Proceedings of the …, 2023 - openaccess.thecvf.com

Large language models (LLMs) have made tremendous progress in natural language
understanding and they have also been successfully adopted in other domains such as …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Live Fitness Coaching as a Testbed for Situated Interaction

S Panchal, A Bhattacharyya, G Berger… - arXiv preprint arXiv …, 2024 - arxiv.org

Tasks at the intersection of vision and language have had a profound impact in advancing
the capabilities of vision-language models such as dialog-based assistants. However …

Top-down Activity Representation Learning for Video Question Answering

Y Wang, S Haruta, D Zeng, J Vizcarra… - arXiv preprint arXiv …, 2024 - arxiv.org

Capturing complex hierarchical human activities, from atomic actions (eg, picking up one
present, moving to the sofa, unwrapping the present) to contextual events (eg, celebrating …

高级搜索

QQ 群

Look, remember and reason: Visual reasoning with grounded rationales

GeoAI in urban analytics

Painter: Teaching Auto-regressive Language Models to Draw Sketches

Live Fitness Coaching as a Testbed for Situated Interaction

Top-down Activity Representation Learning for Video Question Answering

引用