Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

J Yang, K Zhou, Y Li, Z Liu - International Journal of Computer Vision, 2024 - Springer

Abstract Out-of-distribution (OOD) detection is critical to ensuring the reliability and safety of
machine learning systems. For instance, in autonomous driving, we would like the driving …

被引用次数：978 相关文章所有 4 个版本

[PDF] arxiv.org

Generalized out-of-distribution detection and beyond in vision language model era: A survey

A Miyai, J Yang, J Zhang, Y Ming, Y Lin, Q Yu… - arXiv preprint arXiv …, 2024 - arxiv.org

Detecting out-of-distribution (OOD) samples is crucial for ensuring the safety of machine
learning systems and has shaped the field of OOD detection. Meanwhile, several other …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

Is a picture worth a thousand words? delving into spatial reasoning for vision language models

J Wang, Y Ming, Z Shi, V Vineet, X Wang, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) and vision-language models (VLMs) have demonstrated
remarkable performance across a wide range of tasks and domains. Despite this promise …

被引用次数：14 相关文章所有 5 个版本

[PDF] allenpress.com

Introduction to Generative Artificial Intelligence: Contextualizing the Future

R Singh, JY Kim, EF Glassy… - … of pathology & …, 2025 - meridian.allenpress.com

Context.—Generative artificial intelligence (GAI) is a promising new technology with the
potential to transform communication and workflows in health care and pathology. Although …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Large language models for anomaly and out-of-distribution detection: A survey

R Xu, K Ding - arXiv preprint arXiv:2409.01980, 2024 - arxiv.org

Detecting anomalies or out-of-distribution (OOD) samples is critical for maintaining the
reliability and trustworthiness of machine learning systems. Recently, Large Language …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Mmr: Evaluating reading ability of large multimodal models

J Chen, R Zhang, Y Zhou, R Rossi, J Gu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large multimodal models (LMMs) have demonstrated impressive capabilities in
understanding various types of image, including text-rich images. Most existing text-rich …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

SkinGEN: An explainable dermatology diagnosis-to-generation framework with interactive vision-language models

B Lin, Y Xu, X Bao, Z Zhao, Z Zhang, Z Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

With the continuous advancement of vision language models (VLMs) technology,
remarkable research achievements have emerged in the dermatology field, the fourth most …

被引用次数：2 相关文章所有 2 个版本

[PDF] allenpress.com

高级搜索

QQ 群