From easy to hard: Learning language-guided curriculum for visual question answering on remote sensing data

Z Yuan, L Mou, Q Wang, XX Zhu - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Visual question answering (VQA) for remote sensing scene has great potential in intelligent
human–computer interaction system. Although VQA in computer vision has been widely …

视觉问答语言处理方法综述.

王瑞平, 吴士泓, 张美航… - Journal of Computer …, 2022 - search.ebscohost.com
视觉问答中的语言处理方法对视觉问答模型的性能影响巨大. 语言处理方法源于自然语言处理,
但在发展过程中与自然语言处理领域最先进技术脱节, 导致视觉问答中涉及的问题理解和答案 …

Logically Consistent Loss for Visual Question Answering

AC Le-Ngo, T Tran, S Rana, S Gupta… - arXiv preprint arXiv …, 2020 - arxiv.org
Given an image, a back-ground knowledge, and a set of questions about an object, human
learners answer the questions very consistently regardless of question forms and semantic …