Transcribe3d: Grounding llms using transcribed information for 3d referential reasoning with...

文章

学术资源搜索

获得 2 条结果（用时0.07秒）

我的图书馆

Transcribe3d: Grounding llms using transcribed information for 3d referential reasoning with...

在引用文章中搜索

[PDF] arxiv.org

When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

X Ma, Y Bhalgat, B Smart, S Chen, X Li, J Ding… - arXiv preprint arXiv …, 2024 - arxiv.org

As large language models (LLMs) evolve, their integration with 3D spatial data (3D-LLMs)
has seen rapid progress, offering unprecedented capabilities for understanding and …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions

D Liu, Y Liu, W Huang, W Hu - arXiv preprint arXiv:2406.05785, 2024 - arxiv.org

Text-guided 3D visual grounding (T-3DVG), which aims to locate a specific object that
semantically corresponds to a language query from a complicated 3D scene, has drawn …

高级搜索

QQ 群

Transcribe3d: Grounding llms using transcribed information for 3d referential reasoning with...

When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions

引用