Video description: Datasets & evaluation metrics

G Rafiq, M Rafiq, GS Choi - Artificial Intelligence Review, 2023 - Springer

Video description refers to understanding visual content and transforming that acquired
understanding into automatic textual narration. It bridges the key AI fields of computer vision …

被引用次数：16 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] Evaluation metrics for video captioning: A survey

A de Souza Inácio, HS Lopes - Machine Learning with Applications, 2023 - Elsevier

Automatic evaluation metrics play an important role in assessing video captioning systems.
Popular metrics used for assessing such approaches are based on word matching and may …

被引用次数：5 相关文章

[PDF] thecvf.com

SoccerNet-caption: Dense video captioning for soccer broadcasts commentaries

H Mkhallati, A Cioppa, S Giancola… - Proceedings of the …, 2023 - openaccess.thecvf.com

Soccer is more than just a game-it is a passion that transcends borders and unites people
worldwide. From the roar of the crowds to the excitement of the commentators, every …

被引用次数：16 相关文章所有 8 个版本

[HTML] sciencedirect.com

[HTML][HTML] Exploring deep learning approaches for video captioning: A comprehensive review

AJ Yousif, MH Al-Jammas - e-Prime-Advances in Electrical Engineering …, 2023 - Elsevier

While humans can easily describe visual data at varying levels of detail, the same task
presents a significant challenge for machines. This challenge becomes even more complex …

被引用次数：4 相关文章所有 2 个版本

[PDF] springer.com

Bilingual video captioning model for enhanced video retrieval

N Alrebdi, AA Al-Shargabi - Journal of Big Data, 2024 - Springer

Many video platforms rely on the descriptions that uploaders provide for video retrieval.
However, this reliance may cause inaccuracies. Although deep learning-based video …

被引用次数：1 相关文章所有 7 个版本

[PDF] ieee.org

DeepRide: Dashcam video description dataset for autonomous vehicle location-aware trip description

G Rafiq, M Rafiq, BW On, M Sung, GS Choi - IEEE Access, 2022 - ieeexplore.ieee.org

Video description is one of the most challenging task in the combined domain of computer
vision and natural language processing. Captions for various open and constrained domain …

被引用次数：4 相关文章所有 4 个版本

[PDF] ieee.org

Spectral representation learning and fusion for autonomous vehicles trip description exploiting recurrent transformer

G Rafiq, M Rafiq, GS Choi - IEEE Access, 2023 - ieeexplore.ieee.org

A thorough analysis and comprehension of the entire cue set in visual data are
indispensable for an ideal video description model. As outlined in recent algorithm …

被引用次数：1 相关文章所有 3 个版本

Video Annotation & Descriptions using Machine Learning & Deep learning: Critical Survey of methods

P Kaushik, V Saxena - Proceedings of the 2023 Fifteenth International …, 2023 - dl.acm.org

Video description methods aim to produce the most relevant description of a video. This
could be description based on full video, frame based or based on important events of the …

[PDF] iop.org

Confiner Based Video Captioning Model

J Vaishnavi, V Narmatha - IOP Conference Series: Materials …, 2022 - iopscience.iop.org

Video captioning is the interesting task of encoding features and decoding the encoded
features into natural language. Video captioning task is the perfect blend of computer vision …

Video Captcha Proposition based on VQA, NLP, Deep Learning and Computer Vision

E Johri, L Dharod, R Joshi, S Kulkarni… - 2022 5th International …, 2022 - ieeexplore.ieee.org

Visual Question Answering or VQA is a technique used in diverse domains ranging from
simple visual questions and answers on short videos to security. Here in this paper, we talk …

高级搜索

QQ 群