Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation

JH Yang, J Lin - arXiv preprint arXiv:2408.01363, 2024 - arxiv.org
Vision--Language Models (VLMs) have demonstrated success across diverse applications,
yet their potential to assist in relevance judgments remains uncertain. This paper assesses …

How to Engage your Readers? Generating Guiding Questions to Promote Active Reading

P Cui, V Zouhar, X Zhang, M Sachan - arXiv preprint arXiv:2407.14309, 2024 - arxiv.org
Using questions in written text is an effective strategy to enhance readability. However, what
makes an active reading question good, what the linguistic role of these questions is, and …

Images Speak Volumes: User-Centric Assessment of Image Generation for Accessible Communication

M Anschütz, T Sylaj, G Groh - arXiv preprint arXiv:2410.03430, 2024 - arxiv.org
Explanatory images play a pivotal role in accessible and easy-to-read (E2R) texts. However,
the images available in online databases are not tailored toward the respective texts, and …

Harmonizing Assistance: Moderating Visual andTextual Aids in AI-Enhanced Textbook Readingwith IRead

X Zhang, V Dörig, P Cui, V Zouhar, T Netland… - 2024 - researchsquare.com
Textbooks continue to be one of primary mediums of learning. Students often need
additional support during the process of reading textbooks leading to several research …

Análise comparativa entre redes neurais convolucionais eo ChatGPT-4 em termos de desempenho, custo e tempo de processamento na classificação de imagens

ALA dos Santos, DC Rosa, ÁACC Sobrinho… - Revista …, 2025 - periodicos.ifpb.edu.br
Este estudo apresenta uma comparação entre os resultados obtidos de Redes Neurais
Convolucionais (Convolutional Neural Networks–CNNs) e ChatGPT-4 na classificação de …