alt text automatic image captioning- 学术资源搜索

Injecting semantic concepts into end-to-end image captioning

Z Fang, J Wang, X Hu, L Liang, Z Gan… - Proceedings of the …, 2022 - openaccess.thecvf.com

… tags play an important role in improving the captioning performance. Instead of gleaning the
object tags … novel fully VIsion Transformer based image CAPtioning model, dubbed ViTCAP, …

被引用次数：101 相关文章所有 9 个版本

[PDF] arxiv.org

Iconographic image captioning for artworks

E Cetinic - … ICPR International Workshops and Challenges: Virtual …, 2021 - Springer

… not structured primarily as an image captioning dataset, each … on the down-stream task of
image captioning [42]. Transformer-… of artwork images with the goal to generate image captions …

被引用次数：25 相关文章所有 9 个版本

[PDF] arxiv.org

Entity-Aware Multimodal Alignment Framework for News Image Captioning

J Zhang, H Zhang, X Wan - arXiv preprint arXiv:2402.19404, 2024 - arxiv.org

… Therefore, we attempt to create a specialized image-text matching task within the news
image captioning task to align vision features with entity-aware textual features. Furthermore, …

被引用次数：1 相关文章所有 2 个版本

[PDF] thecvf.com

X-linear attention networks for image captioning

Y Pan, T Yao, Y Li, T Mei - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com

… In the standard task of image captioning, we are given an image I to be described with a
natural-language sentence Y1:T . The sentence Y1:T = 1w1, w2, ..., wT l is a sequence of T …

被引用次数：629 相关文章所有 8 个版本

[PDF] arxiv.org

Cnn+ cnn: Convolutional decoders for image captioning

Q Wang, AB Chan - arXiv preprint arXiv:1805.09019, 2018 - arxiv.org

… Inspired by the applications of CNNs in the field of NLP, we develop a framework that only
employ CNNs for image captioning. The main contributions of this paper are: 1. We propose a …

被引用次数：110 相关文章所有 4 个版本

[PDF] github.io

Designing tools for high-quality alt text authoring

K Mack, E Cutrell, B Lee, MR Morris - Proceedings of the 23rd …, 2021 - dl.acm.org

… Recently, researchers created “Twitter A11y,” a browser extension that implements six
methods of adding alt text to images without captions on Twitter [10]. Each of these features …

被引用次数：46 相关文章所有 7 个版本

Image captioning with adaptive incremental global context attention

C Wang, X Gu - Applied Intelligence, 2022 - Springer

… of subsequent words when generating a sentence for image captioning task [9, 10]. … for
image captioning, which makes full use of the global information of the generated target captions …

被引用次数：21 相关文章所有 3 个版本

[PDF] acm.org

Utilizing a Dense Video Captioning Technique for Generating Image Descriptions of Comics for People with Visual Impairments

S Kim, S Lee, K Kim, U Oh - … of the 29th International Conference on …, 2024 - dl.acm.org

… Automatic Alt Text) system that utilizes computer vision technology to identify faces, objects,
and themes in images … results generated by DCC and the image captioning model, DCC was …

被引用次数：1 相关文章所有 2 个版本

Image captioning with novel topics guidance and retrieval-based topics re-weighting

M Al-Qatf, X Wang, A Hawbani… - IEEE Transactions …, 2022 - ieeexplore.ieee.org

… throughout the whole captioning task without considering the … captioning network to focus
on inaccurate image objects. To tackle these challenges, we propose a novel image captioning …

被引用次数：13 相关文章所有 2 个版本

[PDF] 124.222.48.233

Adaptive text denoising network for image caption editing

M Yuan, BK Bao, Z Tan, C Xu - ACM Transactions on Multimedia …, 2023 - dl.acm.org

… We extensively evaluate our proposals on the MS-COCO image captioning dataset and
prove the effectiveness of our method when compared with the state-of-the-arts. …

被引用次数：5 相关文章所有 3 个版本

高级搜索

QQ 群

Injecting semantic concepts into end-to-end image captioning

Iconographic image captioning for artworks

Entity-Aware Multimodal Alignment Framework for News Image Captioning

X-linear attention networks for image captioning

Cnn+ cnn: Convolutional decoders for image captioning

Designing tools for high-quality alt text authoring

Image captioning with adaptive incremental global context attention

Utilizing a Dense Video Captioning Technique for Generating Image Descriptions of Comics for People with Visual Impairments

Image captioning with novel topics guidance and retrieval-based topics re-weighting

Adaptive text denoising network for image caption editing

引用