Recently, visual question answering (VQA) has gained considerable interest within the computer vision and natural language processing (NLP) research areas. The VQA task …
The aim of the image captioning task is to understand various semantic concepts such as objects and their relationships in an image and combine them to generate a natural …
AA Saadi, A Soukane, Y Meraihi, AB Gabis… - IEEE …, 2023 - ieeexplore.ieee.org
The concept of smart cities is to enhance the life quality of residents and provide efficient services by integrating advanced information and communication technologies, autonomous …
H Sharma, AS Jalal - Multimedia Tools and Applications, 2022 - Springer
The text present in natural scenes contains semantic information about its surrounding environment. For example, the majority of questions asked by blind people related to images …
H Sharma, S Srivastava - Journal of Electronic Imaging, 2022 - spiedigitallibrary.org
With the remarkable success of the image captioning tasks, visual attention methods have become a vital part of captioning models. However, most attention-based image captioning …
H Sharma, S Srivastava - Neural Processing Letters, 2023 - Springer
Understanding different semantic concepts, such as objects and their relationships in an image, and integrating them to produce a natural language description is the goal of the …
S Srivastava, H Sharma, P Dixit - 2022 2nd International …, 2022 - ieeexplore.ieee.org
Image captioning is a challenging task that needs the knowledge from both computer vision algorithms and language processing techniques. The model must be able to understand an …
W Tian, H Li, ZQ Zhao - … of the 29th International Conference on …, 2022 - aclanthology.org
Abstract A Visual Question Answering (VQA) model processes images and questions simultaneously with rich semantic information. The attention mechanism can highlight fine …
ABSTRACT Scene Text Visual Question Answering (VQA) needs to understand both the visual contents and the texts in an image to predict an answer for the image-related …