A mobile device framework for video captioning using multimodal neural networks

RJP Damaceno, RM Cesar Jr - Anais Estendidos do XXXVI …, 2023 - sol.sbc.org.br
Video captioning is a computer vision task aimed at providing textual descriptions for videos.
There are numerous strategies and datasets that can be employed to create models capable …

An End-to-End Deep Learning Approach for Video Captioning Through Mobile Devices

RJ Pezzuto Damaceno, RM Cesar Jr - Iberoamerican Congress on Pattern …, 2023 - Springer
Video captioning is a computer vision task that aims at generating a description for video
content. This can be achieved using deep learning approaches that leverage image and …

Custom CNN-BiLSTM model for video captioning

AR Chougule, SD Chavan - Multimedia Tools and Applications, 2024 - Springer
This paper introduces a video captioning model that integrates spatial and temporal feature
extraction methods to produce comprehensive textual descriptions for videos. The …

A multimodal framework for video caption generation

RS Bhooshan, K Suresh - IEEE Access, 2022 - ieeexplore.ieee.org
Video captioning is a highly challenging computer vision task that automatically describes
the video clips using natural language sentences with a clear understanding of the …

Multimodal feature learning for video captioning

S Lee, I Kim - Mathematical Problems in Engineering, 2018 - Wiley Online Library
Video captioning refers to the task of generating a natural language sentence that explains
the content of the input video clips. This study proposes a deep neural network model for …

A Review Of Video Captioning Methods.

D Mahajan, S Bhosale, Y Nighot… - International Journal of …, 2021 - search.ebscohost.com
Video captioning is the process of creating a natural language sentence that summarises
the video's contents automatically. Modeling the video's effective temporal composition and …

Efficient video captioning with frame similarity-based filtering

E Rashno, F Zulkernine - … Conference on Database and Expert Systems …, 2023 - Springer
Video captioning combines computer vision and Natural Language Processing (NLP) to
perform the challenging task of scene understanding. The rapid advancements in artificial …

Video Captioning Using Large Language Models

P Malaviya, D Patel, S Bharti - 2024 3rd International …, 2024 - ieeexplore.ieee.org
In this research, we delve into the intricate domain of video captioning, a pivotal aspect of
multimedia content analysis, by harnessing the synergistic potential of convolutional and …

[HTML][HTML] Exploring deep learning approaches for video captioning: A comprehensive review

AJ Yousif, MH Al-Jammas - e-Prime-Advances in Electrical Engineering …, 2023 - Elsevier
While humans can easily describe visual data at varying levels of detail, the same task
presents a significant challenge for machines. This challenge becomes even more complex …

An efficient deep learning‐based video captioning framework using multi‐modal features

S Varma, DP James - Expert Systems, 2021 - Wiley Online Library
Visual understanding has become more significant in gathering information in many real‐life
applications. For a human, it is a trivial task to understand the content in a visual, however …