Image captioning improved visual question answering

H Sharma, AS Jalal - Multimedia tools and applications, 2022 - Springer
Abstract Both Visual Question Answering (VQA) and image captioning are the problems
which involve Computer Vision (CV) and Natural Language Processing (NLP) domains. In …

Generating question relevant captions to aid visual question answering

J Wu, Z Hu, RJ Mooney - arXiv preprint arXiv:1906.00513, 2019 - arxiv.org
Visual question answering (VQA) and image captioning require a shared body of general
knowledge connecting language and vision. We present a novel approach to improve VQA …

Inner knowledge-based Img2Doc scheme for visual question answering

Q Li, F Xiao, B Bhanu, B Sheng, R Hong - ACM Transactions on …, 2022 - dl.acm.org
Visual Question Answering (VQA) is a research topic of significant interest at the intersection
of computer vision and natural language understanding. Recent research indicates that …

More than an answer: Neural pivot network for visual qestion answering

Y Zhou, R Ji, J Su, Y Wu, Y Wu - Proceedings of the 25th ACM …, 2017 - dl.acm.org
Most of existing works in visual question answering (VQA) are dedicated to improving the
performance of answer predictions, while leaving the explanation of answering unexploited …

An improved attention and hybrid optimization technique for visual question answering

H Sharma, AS Jalal - Neural Processing Letters, 2022 - Springer
Abstract In Visual Question Answering (VQA), an attention mechanism has a critical role in
specifying the different objects present in an image or tells the machine where to focus by …

Visual question answering: Which investigated applications?

S Barra, C Bisogni, M De Marsico, S Ricciardi - Pattern Recognition Letters, 2021 - Elsevier
Abstract Visual Question Answering (VQA) is an extremely stimulating and challenging
research area where Computer Vision (CV) and Natural Language Processig (NLP) have …

Visual question answering model based on graph neural network and contextual attention

H Sharma, AS Jalal - Image and Vision Computing, 2021 - Elsevier
Abstract Visual Question Answering (VQA) has recently appeared as a hot research area in
the field of computer vision and natural language processing. A VQA model uses both image …

See and learn more: Dense caption-aware representation for visual question answering

Y Bi, H Jiang, Y Hu, Y Sun, B Yin - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the rapid development of deep learning models, great improvements have been
achieved in the Visual Question Answering (VQA) field. However, modern VQA models are …

A sequence-to-sequence model approach for imageclef 2018 medical domain visual question answering

R Ambati, CR Dudyala - 2018 15th IEEE India Council …, 2018 - ieeexplore.ieee.org
Numerous attempts have been made in the recent past for the task of free-form and open-
ended Visual Question Answering (VQA). Solving VQA problem typically requires …

Improving visual question answering by referring to generated paragraph captions

H Kim, M Bansal - arXiv preprint arXiv:1906.06216, 2019 - arxiv.org
Paragraph-style image captions describe diverse aspects of an image as opposed to the
more common single-sentence captions that only provide an abstract description of the …