A Survey on Image-text Multimodal Models

R Guo, J Wei, L Sun, B Yu, G Chang, D Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Amidst the evolving landscape of artificial intelligence, the convergence of visual and textual
information has surfaced as a crucial frontier, leading to the advent of image-text multimodal …

A survey on knowledge-enhanced multimodal learning

M Lymperaiou, G Stamou - Artificial Intelligence Review, 2024 - Springer
Multimodal learning has been a field of increasing interest, aiming to combine various
modalities in a single joint representation. Especially in the area of visiolinguistic (VL) …

Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation

A Kritharoula, M Lymperaiou, G Stamou - arXiv preprint arXiv:2310.14025, 2023 - arxiv.org
Visual Word Sense Disambiguation (VWSD) is a novel challenging task with the goal of
retrieving an image among a set of candidates, which better represents the meaning of an …

Language Models as Knowledge Bases for Visual Word Sense Disambiguation

A Kritharoula, M Lymperaiou, G Stamou - arXiv preprint arXiv:2310.01960, 2023 - arxiv.org
Visual Word Sense Disambiguation (VWSD) is a novel challenging task that lies between
linguistic sense disambiguation and fine-grained multimodal retrieval. The recent …

Machine Learning and Knowledge Graphs: Existing Gaps and Future Research Challenges

C d'Amato, L Mahon, P Monnin… - Transactions on Graph …, 2023 - inria.hal.science
The graph model is nowadays largely adopted to model a wide range of knowledge and
data, spanning from social networks to knowledge graphs (KGs), representing a successful …

[PDF][PDF] Αυτόματη παραγωγή εικόνων μόδας με χρήση προτροπής σε γενετικά μοντέλα μηχανικής μάθησης

Γ Αργυρού - 2024 - dspace.lib.ntua.gr
Περίληψη Στο σύγχρονο τοπίο της μόδας, η σύγκλιση τεχνολογίας και δημιουργικότητας έχει
δημιουργήσει νέες ευκαιρίες και αναδρομολογήσει τα πρότυπα της βιομηχανίας. Στο …