Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning- 学术资源搜索

文章

学术资源搜索

Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning

P Sharma, N Ding, S Goodman… - Proceedings of the 56th …, 2018 - aclanthology.org

Proceedings of the 56th Annual Meeting of the Association for …, 2018•aclanthology.org

We present a new dataset of image caption annotations, Conceptual Captions, which
contains an order of magnitude more images than the MS-COCO dataset (Lin et al., 2014)
and represents a wider variety of both images and image caption styles. We achieve this by
extracting and filtering image caption annotations from billions of webpages. We also
present quantitative evaluations of a number of image captioning models and show that a
model architecture based on Inception-ResNetv2 (Szegedy et al., 2016) for image-feature …

Abstract

We present a new dataset of image caption annotations, Conceptual Captions, which contains an order of magnitude more images than the MS-COCO dataset (Lin et al., 2014) and represents a wider variety of both images and image caption styles. We achieve this by extracting and filtering image caption annotations from billions of webpages. We also present quantitative evaluations of a number of image captioning models and show that a model architecture based on Inception-ResNetv2 (Szegedy et al., 2016) for image-feature extraction and Transformer (Vaswani et al., 2017) for sequence modeling achieves the best performance when trained on the Conceptual Captions dataset.

aclanthology.org

展开收起

被引用次数：2197 相关文章所有 3 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image captioning

引用