A Singh, TD Singh, S Bandyopadhyay - arXiv preprint arXiv:2011.14752, 2020 - arxiv.org
Video description involves the generation of the natural language description of actions, events, and objects in the video. There are various applications of video description by filling …
AH Raj, A Seum, A Dash, S Islam… - 2021 26th International …, 2021 - ieeexplore.ieee.org
Generating meaningful textual descriptions from visual contents having the context in consideration is very challenging in terms of Natural Language Processing (NLP) and …
Humans are faced with a constant flow of visual stimuli, eg, from the environment or when looking at social media. In contrast, visually-impaired people are often incapable to perceive …