Language Models as Knowledge Bases for Visual Word Sense Disambiguation

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Language Models as Knowledge Bases for Visual Word Sense Disambiguation

在引用文章中搜索

[PDF] springer.com

A survey on knowledge-enhanced multimodal learning

M Lymperaiou, G Stamou - Artificial Intelligence Review, 2024 - Springer

Multimodal learning has been a field of increasing interest, aiming to combine various
modalities in a single joint representation. Especially in the area of visiolinguistic (VL) …

被引用次数：12 相关文章所有 4 个版本

[PDF] arxiv.org

Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation

A Kritharoula, M Lymperaiou, G Stamou - arXiv preprint arXiv:2310.14025, 2023 - arxiv.org

Visual Word Sense Disambiguation (VWSD) is a novel challenging task with the goal of
retrieving an image among a set of candidates, which better represents the meaning of an …

被引用次数：9 相关文章所有 5 个版本

[PDF] ntua.gr

[PDF][PDF] Αυτόματη παραγωγή εικόνων μόδας με χρήση προτροπής σε γενετικά μοντέλα μηχανικής μάθησης

Γ Αργυρού - 2024 - dspace.lib.ntua.gr

Περίληψη Στο σύγχρονο τοπίο της μόδας, η σύγκλιση τεχνολογίας και δημιουργικότητας έχει
δημιουργήσει νέες ευκαιρίες και αναδρομολογήσει τα πρότυπα της βιομηχανίας. Στο …

高级搜索

QQ 群

Language Models as Knowledge Bases for Visual Word Sense Disambiguation

A survey on knowledge-enhanced multimodal learning

Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation

[PDF][PDF] Αυτόματη παραγωγή εικόνων μόδας με χρήση προτροπής σε γενετικά μοντέλα μηχανικής μάθησης

引用