MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

Y Tang, X Han, X Li, Q Yu, Y Hao, L Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large 2D vision-language models (2D-LLMs) have gained significant attention by bridging
Large Language Models (LLMs) with images using a simple projector. Inspired by their …

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

Y Tang, X Han, X Li, Q Yu, Y Hao, L Hu… - arXiv e …, 2024 - ui.adsabs.harvard.edu
Large 2D vision-language models (2D-LLMs) have gained significant attention by bridging
Large Language Models (LLMs) with images using a simple projector. Inspired by their …