相关文章- 学术资源搜索

Attend to you: Personalized image captioning with context sequence memory networks

C Chunseong Park, B Kim… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We address personalization issues of image captioning, which have not been discussed yet
in previous research. For a query image, we aim to generate a descriptive sentence …

被引用次数：212 相关文章所有 10 个版本

Towards personalized image captioning via multimodal memory networks

CC Park, B Kim, G Kim - IEEE transactions on pattern analysis …, 2018 - ieeexplore.ieee.org

We address personalized image captioning, which generates a descriptive sentence for a
user's image, accounting for prior knowledge such as her active vocabulary or writing style …

被引用次数：53 相关文章所有 6 个版本

[PDF] thecvf.com

Recurrent fusion network for image captioning

W Jiang, L Ma, YG Jiang, W Liu… - Proceedings of the …, 2018 - openaccess.thecvf.com

Recently, much advance has been made in image captioning, and an encoder-decoder
framework has been adopted by all the state-of-the-art models. Under this framework, an …

被引用次数：312 相关文章所有 11 个版本

[PDF] ijcai.org

[PDF][PDF] Show, Observe and Tell: Attribute-driven Attention Model for Image Captioning.

H Chen, G Ding, Z Lin, S Zhao, J Han - IJCAI, 2018 - ijcai.org

Attribute-based approaches and attention-based approaches have been proven to be
effective in image captioning. However, most attribute-based approaches simply predict …

被引用次数：65 相关文章所有 4 个版本

[PDF] thecvf.com

Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning

L Chen, H Zhang, J Xiao, L Nie… - Proceedings of the …, 2017 - openaccess.thecvf.com

Visual attention has been successfully applied in structural prediction tasks such as visual
captioning and question answering. Existing visual attention models are generally spatial …

被引用次数：2069 相关文章所有 9 个版本

[PDF] arxiv.org

Comic: Toward a compact image captioning model with attention

JH Tan, CS Chan, JH Chuah - IEEE Transactions on Multimedia, 2019 - ieeexplore.ieee.org

Recent works in image captioning have shown very promising raw performance. However,
we realize that most of these encoder-decoder style networks with attention do not scale …

被引用次数：51 相关文章所有 8 个版本

[PDF] arxiv.org

Aligning linguistic words and visual semantic units for image captioning

L Guo, J Liu, J Tang, J Li, W Luo, H Lu - Proceedings of the 27th ACM …, 2019 - dl.acm.org

Image captioning attempts to generate a sentence composed of several linguistic words,
which are used to describe objects, attributes, and interactions in an image, denoted as …

被引用次数：110 相关文章所有 5 个版本

Spatio-temporal memory attention for image captioning

J Ji, C Xu, X Zhang, B Wang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Visual attention has been successfully applied in image captioning to selectively incorporate
the most relevant areas to the language generation procedure. However, the attention in …

被引用次数：65 相关文章所有 4 个版本

Transformer-based local-global guidance for image captioning

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Expert Systems with …, 2023 - Elsevier

Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …

被引用次数：11 相关文章所有 2 个版本

[PDF] thecvf.com

Deep reinforcement learning-based image captioning with embedding reward

Z Ren, X Wang, N Zhang, X Lv… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

Image captioning is a challenging problem owing to the complexity in understanding the
image content and diverse ways of describing it in natural language. Recent advances in …

被引用次数：410 相关文章所有 13 个版本

高级搜索

QQ 群

Attend to you: Personalized image captioning with context sequence memory networks

Towards personalized image captioning via multimodal memory networks

Recurrent fusion network for image captioning

[PDF][PDF] Show, Observe and Tell: Attribute-driven Attention Model for Image Captioning.

Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning

Comic: Toward a compact image captioning model with attention

Aligning linguistic words and visual semantic units for image captioning

Spatio-temporal memory attention for image captioning

Transformer-based local-global guidance for image captioning

Deep reinforcement learning-based image captioning with embedding reward

相关搜索

引用