Boosted transformer for image captioning

J Li, P Yao, L Guo, W Zhang - Applied Sciences, 2019 - mdpi.com
Image captioning attempts to generate a description given an image, usually taking
Convolutional Neural Network as the encoder to extract the visual features and a sequence …

DAA: Dual LSTMs with adaptive attention for image captioning

F Xiao, X Gong, Y Zhang, Y Shen, J Li, X Gao - Neurocomputing, 2019 - Elsevier
Image captioning enables people to better understand images through fine-grained
analysis. Recently the encoder-decoder architecture with attention mechanism has achieved …

AutoCaption: Image captioning with neural architecture search

X Zhu, W Wang, L Guo, J Liu - arXiv preprint arXiv:2012.09742, 2020 - arxiv.org
Image captioning transforms complex visual information into abstract natural language for
representation, which can help computers understanding the world quickly. However, due to …

Learning to Answer Multilingual and Code-Mixed Questions

D Gupta - arXiv preprint arXiv:2211.07522, 2022 - arxiv.org
Question-answering (QA) that comes naturally to humans is a critical component in
seamless human-computer interaction. It has emerged as one of the most convenient and …

Analysis on Semantic categorization technique of analogous label by using Deep Learning Technique

RS Shyam, B Sriman, R Pavithra, N Rao… - … and Smart Electrical …, 2022 - ieeexplore.ieee.org
In current era, by the arrival mobile Internet, the quantity of statistics and available data in the
system is increasing drastically, particularly moderately structured and partially structured …

[PDF][PDF] 融合多标签和双注意力机制的图像语义理解模型

吴倩, 应捷, 黄影平, 杨海马, 胡文凯 - 智能计算机与应用, 2020 - cs.hit.edu.cn
针对现有图像语义理解模型存在描述不充分以及视觉属性冗余的问题, 提出了一种带有视觉三元
组标签且能够挖掘潜在信息的图像语义理解模型VT-BLSTM. 首先, 使用卷积神经网络提取图像 …

[引用][C] 基于改进的Transformer_decoder 的增强图像描述

林椹尠, 屈嘉欣, 罗亮 - 计算机与现代化, 2023