P Qiang, H Tan, X Li, D Wang, R Li, X Sun, H Zhang… - Neurocomputing, 2025 - Elsevier
Current state-of-the-art (SOTA) KB-VQA techniques involve transforming images into image
captions as prompts to harness the potent reasoning capabilities of large language models …