查看文章

作者

Mohammad Saif Wajid, Hugo Terashima‐Marin, Peyman Najafirad, Mohd Anas Wajid

发表日期

2024/1

来源

Engineering Reports

卷号

期号

页码范围

e12785

出版商

John Wiley & Sons, Inc.

简介

Generating an image/video caption has always been a fundamental problem of Artificial Intelligence, which is usually performed using the potential of Deep Learning Methods, Computer Vision, Knowledge Graphs, and Natural Language Processing (NLP). The significant task of image/video captioning is to describe visual content in terms of natural language. Due to a semantic gap, this presents a massive problem in understanding and explaining images or videos syntactically and semantically. The current systems need somewhere to fill the gap between low‐level and high‐level features while mapping. Therefore, to tackle this problem, there is a need to describe the latest research and methods to overcome difficulties and to propose effective solutions. This work thoroughly analyses and investigates the most related methods (deep learning and knowledge graph‐based approaches), benchmark datasets, and …

引用总数

被引用次数：10

202320242 7

学术搜索中的文章

Deep learning and knowledge graph for image/video captioning: A review of datasets, evaluation metrics, and methods

MS Wajid, H Terashima‐Marin, P Najafirad, MA Wajid - Engineering Reports, 2024

被引用次数：10 相关文章所有 2 个版本