Graph neural networks in vision-language image understanding: A survey

H Senior, G Slabaugh, S Yuan, L Rossi - The Visual Computer, 2024 - Springer
Abstract 2D image understanding is a complex problem within computer vision, but it holds
the key to providing human-level scene comprehension. It goes further than identifying the …

Automatic image caption generation using deep learning

A Verma, AK Yadav, M Kumar, D Yadav - Multimedia Tools and …, 2024 - Springer
Image captioning is an interesting and challenging task with applications in diverse domains
such as image retrieval, organizing and locating images of users' interest, etc. It has huge …

Implementation and optimization of image processing algorithm using machine learning and image compression

G Zacharis, G Gadounas, P Tsirtsakis… - SHS Web of …, 2022 - shs-conferences.org
This research paper deals with the implementation of an image captioning algorithm using
Tensorflow, Keras, and Python, as well as a way proposed for optimization, using image …

Cross-language multimodal scene semantic guidance and leap sampling for video captioning

B Sun, Y Wu, Y Zhao, Z Hao, L Yu, J He - The Visual Computer, 2023 - Springer
In recent years, video captioning, which uses natural language to describe video content,
has achieved encouraging results. However, most of the previous studies in this area have …

Multi-channel weighted fusion for image captioning

J Zhong, Y Cao, Y Zhu, J Gong, Q Chen - The Visual Computer, 2023 - Springer
Automatically describing the detail and content of the image is a meaningful but difficult task.
In this paper, we propose a variety of optimization improvements to enhance the encoder …

GAF-Net: Global view guided attribute fusion network for remote sensing image captioning

Y Peng, Y Jia, J Chen, X Ji - Multimedia Tools and Applications, 2024 - Springer
Remote sensing image captioning is a comprehensive task in the field of image captioning
and remote sensing, and it is an emerging research hotspot in the deep learning field. At …

An Efficient Deep Learning based Hybrid Model for Image Caption Generation

M Kaur, H Kaur - … Journal of Advanced Computer Science and …, 2023 - search.proquest.com
In the recent yeas, with the increase in the use of different social media platforms, image
captioning approach play a major role in automatically describe the whole image into …

Structured Encoding Based on Semantic Disambiguation for Video Captioning

B Sun, J Tian, Y Wu, L Yu, Y Tang - Cognitive Computation, 2024 - Springer
Video captioning, which aims to automatically generate video captions, has gained
significant attention due to its wide range of applications in video surveillance and retrieval …

Image Caption Generation Using Deep Learning Algorithm

K Gupta, D Goyal, SK Mishra - Educational Administration: Theory and …, 2024 - kuey.net
This study investigates the effectiveness of an image captioning model utilizing VGG16 and
LSTM architectures on the Flickr8K dataset. Through meticulous experimentation and …

Deep Learning Hybrid Technique for Generation of Image Caption

N Rakshith, MG BK, N Preetham… - … Conference on Signal …, 2024 - ieeexplore.ieee.org
Image captioning is a fascinating and demanding work with applications in many different
fields, including image retrieval, organizing and finding user-interested images, etc. It has …