Dual global enhanced transformer for image captioning

T Xian, Z Li, C Zhang, H Ma - Neural Networks, 2022 - Elsevier
Transformer-based architectures have shown great success in image captioning, where self-
attention module can model source and target interaction (eg, object-to-object, object-to …

Adaptive path selection for dynamic image captioning

T Xian, Z Li, Z Tang, H Ma - … on Circuits and Systems for Video …, 2022 - ieeexplore.ieee.org
Image captioning is a challenging task, ie, given an image machine automatically generates
natural language that matches its semantic content and has attracted much attention in …

Image caption generation using contextual information fusion with Bi-LSTM-s

H Zhang, C Ma, Z Jiang, J Lian - IEEE Access, 2022 - ieeexplore.ieee.org
The image caption generation algorithm necessitates the expression of image content using
accurate natural language. Given the existing encoder-decoder algorithm structure, the …

Matching images and texts with multi-head attention network for cross-media hashing retrieval

Z Li, X Xie, F Ling, H Ma, Z Shi - Engineering Applications of Artificial …, 2021 - Elsevier
The cross-media hashing retrieval generally encodes multimedia data into a common binary
hash space, which can effectively measure the correlation between samples from different …

Improving image captioning with Pyramid Attention and SC-GAN

T Chen, Z Li, J Wu, H Ma, B Su - Image and Vision Computing, 2022 - Elsevier
Most of the existing image captioning models mainly use global attention, which represents
the whole image features, local attention, representing the object features, or a combination …

Enhance understanding and reasoning ability for image captioning

J Wei, Z Li, J Zhu, H Ma - Applied Intelligence, 2023 - Springer
Image captioning aims to generate a grammatically correct and semantically accurate
natural language description of a given image. To better capture the complex information …

Image captioning using transformer-based double attention network

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Engineering Applications of …, 2023 - Elsevier
Image captioning generates a human-like description for a query image, which has attracted
considerable attention recently. The most broadly utilized model for image description is an …

Transformer approaches in image captioning: a literature review

H Tsaniya, C Fatichah, N Suciati - 2022 14th International …, 2022 - ieeexplore.ieee.org
Image captioning is one of the challenging tasks that cross the computer vision and the
Natural Language Processing (NLP) domain. Its main task is to interpret images in a …

Image captioning with novel topics guidance and retrieval-based topics re-weighting

M Al-Qatf, X Wang, A Hawbani… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Topic modelling (TM) has shown significant progress in boosting the effectiveness of image
captioning in the last few years. Although important improvements have been shown in …

ICEAP: An advanced fine-grained image captioning network with enhanced attribute predictor

MB Hossen, Z Ye, A Abdussalam, MA Hossain - Displays, 2024 - Elsevier
Fine-grained image captioning is a focal point in the vision-to-language task and has
attracted considerable attention for generating accurate and contextually relevant image …