Learning to collocate neural modules for image captioning

X Yang, H Zhang, J Cai - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
… to dynamically collocate neural modules on-the-fly during captioning. … of neural modules
for visual reasoning [58, 3]. However, we believe that a combination of simple neural modules

Image captioning with compositional neural module networks

J Tian, J Oh - arXiv preprint arXiv:2007.05608, 2020 - arxiv.org
… in an input image. Inspired by the idea of the compositional neural module networks in the
vi… introduce a hierarchical framework for image captioning that explores both compositionality …

A modularized architecture of multi-branch convolutional neural network for image captioning

S He, Y Lu - Electronics, 2019 - mdpi.com
… text according to the input image. In this paper, we … neural network (CNN) as the encoder
and recurrent neural network (RNN) as the decoder. In order to get better image captioning

Learning to collocate visual-linguistic neural modules for image captioning

X Yang, H Zhang, C Gao, J Cai - International Journal of Computer Vision, 2023 - Springer
… of collocating visual-linguistic modules is more challenging. … collocate the modules during
the process of image captioning. To … : (1) distinguishable module design—four modules in the …

Neural attention for image captioning: review of outstanding methods

Z Zohourianshahzadi, JK Kalita - Artificial Intelligence Review, 2022 - Springer
… for image captioning. Instead of offering a comprehensive review of all prior work on deep
image captioning … mechanisms used for the task of image captioning in deep learning models. …

A neural compositional paradigm for image captioning

B Dai, S Fidler, D Lin - Advances in Neural Information …, 2018 - proceedings.neurips.cc
… in the generated captions, and … for image captioning, which factorizes the captioning
procedure into two stages: (1) extracting an explicit semantic representation from the given image; …

Learning cooperative neural modules for stylized image captioning

X Wu, W Zhao, J Luo - International Journal of Computer Vision, 2022 - Springer
… With this in mind, we propose a novel stylized image captioning approach … neural modules
under the reinforcement learning paradigm. A low-level neural module called syntax module

Attention correctness in neural image captioning

C Liu, J Mao, F Sha, A Yuille - Proceedings of the AAAI conference on …, 2017 - ojs.aaai.org
… In this work we focus on attention models for image captioning. The state-of-the-art … of image
captioning, we evaluated the state-of-the-art models with implicitly trained attention modules

Hierarchical attention network for image captioning

W Wang, Z Chen, H Hu - Proceedings of the AAAI conference on …, 2019 - ojs.aaai.org
… Given an image I, the image captioning model needs to generate a caption sequence w =
{w1,w2,...,wT }, … t of the visual LSTM, we apply a neural network to normalize attention weights: …

AutoCaption: Image captioning with neural architecture search

X Zhu, W Wang, L Guo, J Liu - arXiv preprint arXiv:2012.09742, 2020 - arxiv.org
… Our goal here is to optimize the text generation module of the image captioning model. We
used the RNN architecture as our text generation module and our goal here is to optimize the …