Attend to you: Personalized image captioning with context sequence memory networks

C Chunseong Park, B Kim… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We address personalization issues of image captioning, which have not been discussed yet
in previous research. For a query image, we aim to generate a descriptive sentence …

Towards personalized image captioning via multimodal memory networks

CC Park, B Kim, G Kim - IEEE transactions on pattern analysis …, 2018 - ieeexplore.ieee.org
We address personalized image captioning, which generates a descriptive sentence for a
user's image, accounting for prior knowledge such as her active vocabulary or writing style …

Recurrent fusion network for image captioning

W Jiang, L Ma, YG Jiang, W Liu… - Proceedings of the …, 2018 - openaccess.thecvf.com
Recently, much advance has been made in image captioning, and an encoder-decoder
framework has been adopted by all the state-of-the-art models. Under this framework, an …

[PDF][PDF] Show, Observe and Tell: Attribute-driven Attention Model for Image Captioning.

H Chen, G Ding, Z Lin, S Zhao, J Han - IJCAI, 2018 - ijcai.org
Attribute-based approaches and attention-based approaches have been proven to be
effective in image captioning. However, most attribute-based approaches simply predict …

Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning

L Chen, H Zhang, J Xiao, L Nie… - Proceedings of the …, 2017 - openaccess.thecvf.com
Visual attention has been successfully applied in structural prediction tasks such as visual
captioning and question answering. Existing visual attention models are generally spatial …

Comic: Toward a compact image captioning model with attention

JH Tan, CS Chan, JH Chuah - IEEE Transactions on Multimedia, 2019 - ieeexplore.ieee.org
Recent works in image captioning have shown very promising raw performance. However,
we realize that most of these encoder-decoder style networks with attention do not scale …

Aligning linguistic words and visual semantic units for image captioning

L Guo, J Liu, J Tang, J Li, W Luo, H Lu - Proceedings of the 27th ACM …, 2019 - dl.acm.org
Image captioning attempts to generate a sentence composed of several linguistic words,
which are used to describe objects, attributes, and interactions in an image, denoted as …

Spatio-temporal memory attention for image captioning

J Ji, C Xu, X Zhang, B Wang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Visual attention has been successfully applied in image captioning to selectively incorporate
the most relevant areas to the language generation procedure. However, the attention in …

Transformer-based local-global guidance for image captioning

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Expert Systems with …, 2023 - Elsevier
Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …

Deep reinforcement learning-based image captioning with embedding reward

Z Ren, X Wang, N Zhang, X Lv… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Image captioning is a challenging problem owing to the complexity in understanding the
image content and diverse ways of describing it in natural language. Recent advances in …