Deep learning and knowledge graph for image/video captioning: A review of datasets, evaluation metrics, and methods

MS Wajid, H Terashima‐Marin, P Najafirad… - Engineering …, 2024 - Wiley Online Library
Generating an image/video caption has always been a fundamental problem of Artificial
Intelligence, which is usually performed using the potential of Deep Learning Methods …

Insights into Object Semantics: Leveraging Transformer Networks for Advanced Image Captioning

D Abdal Hafeth, S Kollias - Sensors, 2024 - mdpi.com
Image captioning is a technique used to generate descriptive captions for images. Typically,
it involves employing a Convolutional Neural Network (CNN) as the encoder to extract visual …

AFSDCGN: Adaptive Feature Scaling and Dynamic Contextual Graph Networks for image captioning with unseen relationship detection

YA Thakare, KH Walse, M Atique - Multimedia Tools and Applications, 2024 - Springer
Automated image captioning systems play a crucial role in various applications such as
assistive technologies, content indexing, and robotics. However, current frameworks face …

Cloud-IoT Application for Scene Understanding in Assisted Living: Unleashing the Potential of Image Captioning and Large Language Model (ChatGPT)

DA Hafeth, G Lal, M Al-Khafajiy… - … on Developments in …, 2023 - ieeexplore.ieee.org
Vision is a vital sense that plays a pivotal role in our understanding of the world. The majority
of our external information is acquired through our visual system, which significantly impacts …

Evolution of Image Captioning Models: An Overview

A Saouabe, S Tkatek, M Mazar… - 2023 10th International …, 2023 - ieeexplore.ieee.org
This article presents a state-of-the-art review of image captioning methodologies developed
in the past five years. Image captioning, which aims to generate text describing the visual …

[PDF][PDF] Machine Learning for Forecasting: A Comparative Analysis

K Papadakis - 2024 - dspace.lib.ntua.gr
This thesis investigates the performance of advanced machine learning models for time
series forecasting. Prophet, N-BEATS, DeepAR, DeepVAR, and the Temporal Fusion …

Voice Enabled Deep Learning Based Image Captioning Solution for Guided Navigation

S Kesavan, J Jayakumar, AK KS… - 2023 International …, 2023 - ieeexplore.ieee.org
The use of technology to assist visually impaired individuals is crucial in addressing the
global issue of vision impairment. Worldwide more than billion people suffer from a vision …