F Wang, B Wang, F Xu, J Li, P Liu - International Conference on …, 2023 - Springer
Abstract In Visual Question Answering (VQA) task, extracting semantic information from
multimodalities and effectively utilizing this information for interaction is crucial. Existing VQA …