相关文章- 学术资源搜索

Spatio-temporal memory attention for image captioning

J Ji, C Xu, X Zhang, B Wang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Visual attention has been successfully applied in image captioning to selectively incorporate
the most relevant areas to the language generation procedure. However, the attention in …

被引用次数：68 相关文章所有 4 个版本

[PDF] ijcai.org

[PDF][PDF] Show, Observe and Tell: Attribute-driven Attention Model for Image Captioning.

H Chen, G Ding, Z Lin, S Zhao, J Han - IJCAI, 2018 - ijcai.org

Attribute-based approaches and attention-based approaches have been proven to be
effective in image captioning. However, most attribute-based approaches simply predict …

被引用次数：66 相关文章所有 4 个版本

[PDF] thecvf.com

Look back and predict forward in image captioning

Y Qin, J Du, Y Zhang, H Lu - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com

Most existing attention-based methods on image captioning focus on the current word and
visual information in one time step and generate the next word, without considering the …

被引用次数：145 相关文章所有 4 个版本

[PDF] arxiv.org

Aligning linguistic words and visual semantic units for image captioning

L Guo, J Liu, J Tang, J Li, W Luo, H Lu - Proceedings of the 27th ACM …, 2019 - dl.acm.org

Image captioning attempts to generate a sentence composed of several linguistic words,
which are used to describe objects, attributes, and interactions in an image, denoted as …

被引用次数：116 相关文章所有 5 个版本

[PDF] arxiv.org

Dual attention on pyramid feature maps for image captioning

L Yu, J Zhang, Q Wu - IEEE Transactions on Multimedia, 2021 - ieeexplore.ieee.org

Generating natural sentences from images is a fundamental learning task for visual-
semantic understanding in multimedia. In this paper, we propose to apply dual attention on …

被引用次数：46 相关文章所有 4 个版本

Task-adaptive attention for image captioning

C Yan, Y Hao, L Li, J Yin, A Liu, Z Mao… - … on Circuits and …, 2021 - ieeexplore.ieee.org

Attention mechanisms are now widely used in image captioning models. However, most
attention models only focus on visual features. When generating syntax related words, little …

被引用次数：251 相关文章所有 2 个版本

A new attention-based LSTM for image captioning

F Xiao, W Xue, Y Shen, X Gao - Neural Processing Letters, 2022 - Springer

Image captioning aims to describe the content of an image with a complete and natural
sentence. Recently, the image captioning methods with encoder-decoder architecture has …

被引用次数：29 相关文章所有 3 个版本

Divergent-convergent attention for image captioning

J Ji, Z Du, X Zhang - Pattern Recognition, 2021 - Elsevier

Attention mechanism has made great progress in image captioning, where semantic words
or local regions are selectively embedded into the language model. However, current …

被引用次数：34 相关文章所有 2 个版本

The synergy of double attention: Combine sentence-level and word-level attention for image captioning

H Wei, Z Li, C Zhang, H Ma - Computer Vision and Image Understanding, 2020 - Elsevier

The existing attention models of image captioning typically extract only word-level attention
information, ie, the attention mechanism extracts local attention information from the image …

被引用次数：34 相关文章

Bi-directional co-attention network for image captioning

W Jiang, W Wang, H Hu - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org

Image Captioning, which automatically describes an image with natural language, is
regarded as a fundamental challenge in computer vision. In recent years, significant …

被引用次数：35 相关文章

高级搜索

QQ 群