LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning

S Chen, X Chen, C Zhang, M Li, G Yu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Recent progress in Large Multimodal Models (LMM) has opened up great
possibilities for various applications in the field of human-machine interactions. However …

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

S Chen, X Chen, C Zhang, M Li, G Yu, H Fei… - arXiv e …, 2023 - ui.adsabs.harvard.edu
Abstract Recent advances in Large Multimodal Models (LMM) have made it possible for
various applications in human-machine interactions. However, developing LMMs that can …

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

S Chen, X Chen, C Zhang, M Li, G Yu, H Fei… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent advances in Large Multimodal Models (LMM) have made it possible for various
applications in human-machine interactions. However, developing LMMs that can …