Fine-grained image captioning with global-local discriminative objective

J Wu, T Chen, H Wu, Z Yang, G Luo… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
… Similarly, the discriminative captions can also improve the accuracy of the caption. It can be
… tive to optimize the image captioning model, which encourages generating fine-grained and …

High-quality image captioning with fine-grained and semantic-guided visual attention

Z Zhang, Q Wu, Y Wang, F Chen - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
… In this paper, we propose a novel image captioning model with fine-grained and semantic-guided
visual attention based on a novel Fully Convolutional Network (FCN)-LSTM framework…

EMScore: Evaluating video captioning via coarse-grained and fine-grained embedding matching

Y Shi, X Yang, H Xu, C Yuan, B Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
image captioning. In this paper, we propose an evaluation metric specifically for video captioning
… We consider not only coarse-grained embedding matching between video and text but …

Fine-grained image captioning with clip reward

J Cho, S Yoon, A Kale, F Dernoncourt, T Bui… - arXiv preprint arXiv …, 2022 - arxiv.org
… A More Image Captioning Examples We provide more image captioning examples using
different reward functions in Table 5. Overall, the captions from the model with CLIP-S+…

Reo-relevance, extraness, omission: A fine-grained evaluation for image captioning

M Jiang, J Hu, Q Huang, L Zhang, J Diesner… - arXiv preprint arXiv …, 2019 - arxiv.org
fine-grained, error-aware evaluation method REO to measure the quality of machine-generated
image captions … ground truth, extra description beyond image content, and omitted …

Context-aware visual policy network for fine-grained image captioning

ZJ Zha, D Liu, H Zhang, Y Zhang… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
… the proposed fine-grained image captioning framework. We first formulate the image
caption… 3.1 Overview We formulate the task of image captioning into a sequential decision-making …

A thorough review of models, evaluation metrics, and datasets on image captioning

G Luo, L Cheng, C Jing, C Zhao… - IET Image Processing, 2022 - Wiley Online Library
… Furthermore, we review the primary datasets used to explore image captions, from
domain-specific benchmarks to domain-specific datasets collected to investigate specific aspects of …

C-Rnn: a fine-grained language model for image captioning

G Huang, H Hu - Neural Processing Letters, 2019 - Springer
… a c-RNN image captioning model, which as far as we know, is the first one to model image
captions in character level. Compared with existing works on image captioning, there are two …

[PDF][PDF] Fine-Grained Features for Image Captioning

M Shao, J Feng, J Wu, H Zhang… - Computers, Materials & …, 2023 - cdn.techscience.cn
… used to extract image features in image captioning, and the … and do not pay attention to
fine-grained details because of the … properly generates captions by fusing fine-grained features …

Integration of textual cues for fine-grained image captioning using deep CNN and LSTM

N Gupta, AS Jalal - Neural Computing and Applications, 2020 - Springer
… recurrent neural networks for image captioning. However, in … in an image can give more
fined-grained captioning of a … of image captioning by fusing text feature available in an image