基于视觉区域聚合与双向协作的端到端图像描述生成

宋井宽, 曾鹏鹏, 顾嘉扬, 朱晋宽, 高联丽 - 软件学报, 2022 - jos.org.cn
… Conceptual captions: A cleaned, hypernymed, image alt-text dataset for automatic image
A beam-search decoder for normalization of social media text with application to machine …