Video description: A comprehensive survey of deep learning approaches

G Rafiq, M Rafiq, GS Choi - Artificial Intelligence Review, 2023 - Springer
Video description refers to understanding visual content and transforming that acquired
understanding into automatic textual narration. It bridges the key AI fields of computer vision …

[HTML][HTML] Evaluation metrics for video captioning: A survey

A de Souza Inácio, HS Lopes - Machine Learning with Applications, 2023 - Elsevier
Automatic evaluation metrics play an important role in assessing video captioning systems.
Popular metrics used for assessing such approaches are based on word matching and may …

SoccerNet-caption: Dense video captioning for soccer broadcasts commentaries

H Mkhallati, A Cioppa, S Giancola… - Proceedings of the …, 2023 - openaccess.thecvf.com
Soccer is more than just a game-it is a passion that transcends borders and unites people
worldwide. From the roar of the crowds to the excitement of the commentators, every …

[HTML][HTML] Exploring deep learning approaches for video captioning: A comprehensive review

AJ Yousif, MH Al-Jammas - e-Prime-Advances in Electrical Engineering …, 2023 - Elsevier
While humans can easily describe visual data at varying levels of detail, the same task
presents a significant challenge for machines. This challenge becomes even more complex …

Bilingual video captioning model for enhanced video retrieval

N Alrebdi, AA Al-Shargabi - Journal of Big Data, 2024 - Springer
Many video platforms rely on the descriptions that uploaders provide for video retrieval.
However, this reliance may cause inaccuracies. Although deep learning-based video …

DeepRide: Dashcam video description dataset for autonomous vehicle location-aware trip description

G Rafiq, M Rafiq, BW On, M Sung, GS Choi - IEEE Access, 2022 - ieeexplore.ieee.org
Video description is one of the most challenging task in the combined domain of computer
vision and natural language processing. Captions for various open and constrained domain …

Spectral representation learning and fusion for autonomous vehicles trip description exploiting recurrent transformer

G Rafiq, M Rafiq, GS Choi - IEEE Access, 2023 - ieeexplore.ieee.org
A thorough analysis and comprehension of the entire cue set in visual data are
indispensable for an ideal video description model. As outlined in recent algorithm …

Video Annotation & Descriptions using Machine Learning & Deep learning: Critical Survey of methods

P Kaushik, V Saxena - Proceedings of the 2023 Fifteenth International …, 2023 - dl.acm.org
Video description methods aim to produce the most relevant description of a video. This
could be description based on full video, frame based or based on important events of the …

Confiner Based Video Captioning Model

J Vaishnavi, V Narmatha - IOP Conference Series: Materials …, 2022 - iopscience.iop.org
Video captioning is the interesting task of encoding features and decoding the encoded
features into natural language. Video captioning task is the perfect blend of computer vision …

Video Captcha Proposition based on VQA, NLP, Deep Learning and Computer Vision

E Johri, L Dharod, R Joshi, S Kulkarni… - 2022 5th International …, 2022 - ieeexplore.ieee.org
Visual Question Answering or VQA is a technique used in diverse domains ranging from
simple visual questions and answers on short videos to security. Here in this paper, we talk …