Image captioning with word gate and adaptive self-critical learning

J Li, P Yao, L Guo, W Zhang - Applied Sciences, 2019 - mdpi.com

Image captioning attempts to generate a description given an image, usually taking
Convolutional Neural Network as the encoder to extract the visual features and a sequence …

被引用次数：42 相关文章所有 6 个版本

DAA: Dual LSTMs with adaptive attention for image captioning

F Xiao, X Gong, Y Zhang, Y Shen, J Li, X Gao - Neurocomputing, 2019 - Elsevier

Image captioning enables people to better understand images through fine-grained
analysis. Recently the encoder-decoder architecture with attention mechanism has achieved …

被引用次数：40 相关文章所有 2 个版本

[PDF] arxiv.org

AutoCaption: Image captioning with neural architecture search

X Zhu, W Wang, L Guo, J Liu - arXiv preprint arXiv:2012.09742, 2020 - arxiv.org

Image captioning transforms complex visual information into abstract natural language for
representation, which can help computers understanding the world quickly. However, due to …

被引用次数：19 相关文章所有 2 个版本

[PDF] arxiv.org

Learning to Answer Multilingual and Code-Mixed Questions

D Gupta - arXiv preprint arXiv:2211.07522, 2022 - arxiv.org

Question-answering (QA) that comes naturally to humans is a critical component in
seamless human-computer interaction. It has emerged as one of the most convenient and …

Analysis on Semantic categorization technique of analogous label by using Deep Learning Technique

RS Shyam, B Sriman, R Pavithra, N Rao… - … and Smart Electrical …, 2022 - ieeexplore.ieee.org

In current era, by the arrival mobile Internet, the quantity of statistics and available data in the
system is increasing drastically, particularly moderately structured and partially structured …

[PDF] hit.edu.cn

[PDF][PDF] 融合多标签和双注意力机制的图像语义理解模型

吴倩，应捷，黄影平，杨海马，胡文凯 - 智能计算机与应用, 2020 - cs.hit.edu.cn

针对现有图像语义理解模型存在描述不充分以及视觉属性冗余的问题, 提出了一种带有视觉三元
组标签且能够挖掘潜在信息的图像语义理解模型VT-BLSTM. 首先, 使用卷积神经网络提取图像 …

[引用][C] 基于改进的Transformer_decoder 的增强图像描述

林椹尠，屈嘉欣，罗亮 - 计算机与现代化, 2023

高级搜索

QQ 群