Generating image captions using Bahdanau attention mechanism and transfer learning

S Ayoub, Y Gulzar, FA Reegu, S Turaev - Symmetry, 2022 - mdpi.com
Automatic image caption prediction is a challenging task in natural language processing.
Most of the researchers have used the convolutional neural network as an encoder and …

Advancing image captioning with V16HP1365 encoder and dual self-attention network

T Jaiswal, M Pandey, P Tripathi - Multimedia Tools and Applications, 2024 - Springer
Image captioning generates textual description from the corresponding input image with the
help of computer vision and natural language processing. In recent years, deep learning …

Enhanced Image Captioning Using Bahdanau Attention Mechanism and Heuristic Beam Search Algorithm

S Abinaya, M Deepak, AS Alphonse - IEEE Access, 2024 - ieeexplore.ieee.org
Captioning images is a challenging task at the intersection of Computer Vision (CV) and
Natural Language Processing (NLP), that involves generating descriptive text to depict the …

Comparitive study of GRU and LSTM cells based Video Captioning Models

H Maru, T Chandana, D Naik - 2021 12th International …, 2021 - ieeexplore.ieee.org
Video Captioning task involves generating descriptive text for the events and objects in the
videos. It mainly involves taking a video, which is nothing but a sequence of frames, as data …

Comparison of Deep Learning Models for Automatic Image Descriptors

L Agarwal, B Verma - 2023 IEEE 20th India Council …, 2023 - ieeexplore.ieee.org
Image description is a task which combines the methods like Natural Language Processing,
Artificial Intelligence and Computer Vision, which aims to generate contextually and …

Siamese-Driven Optimization for Low-Resolution Image Latent Embedding in Image Captioning

JJ Tan, A Mokraoui, BH Kwan… - 2024 Signal …, 2024 - ieeexplore.ieee.org
Image captioning is essential in many fields including assisting visually impaired individuals,
improving content management systems, and enhancing human-computer interaction …

Attention based Image Captioning using Depth-wise Separable Convolution

VR Mallick, D Naik - 2021 12th International Conference on …, 2021 - ieeexplore.ieee.org
Automatically generating descriptions for an image has been one of the trending topics in
the field of Computer Vision. This is due to the fact that various real-life applications like self …

Automated Image Annotation with Voice Synthesis Using Machine Learning

PY Prasad, M Ramu, IG Priya, KS Sree… - 2024 2nd World …, 2024 - ieeexplore.ieee.org
The goal of our project is to develop automated image annotator with voice synthesis using
machine learning that is developed with help of CNN (Convolutional Neural Network) and …

Loss Optimised Video Captioning using Deep-Lstm, Attention Mechanism and Weighted Loss Metrices

N Yadav, D Naik - 2021 12th International Conference on …, 2021 - ieeexplore.ieee.org
The aim of the video captioning task is to use multiple natural-language sentences to define
video content. Photographic, graphical, and auditory data are all used in the videos. Our …

Weakly Supervised Image Annotation and Segmentation

D Naik, CD Jaidhar - 2021 12th International Conference on …, 2021 - ieeexplore.ieee.org
The various aspects in the processing of an image include object recognition, object
classification, image segmentation, and attribute learning, are closely related to each other …