Generalized out-of-distribution detection: A survey

J Yang, K Zhou, Y Li, Z Liu - International Journal of Computer Vision, 2024 - Springer
Abstract Out-of-distribution (OOD) detection is critical to ensuring the reliability and safety of
machine learning systems. For instance, in autonomous driving, we would like the driving …

Generalized out-of-distribution detection and beyond in vision language model era: A survey

A Miyai, J Yang, J Zhang, Y Ming, Y Lin, Q Yu… - arXiv preprint arXiv …, 2024 - arxiv.org
Detecting out-of-distribution (OOD) samples is crucial for ensuring the safety of machine
learning systems and has shaped the field of OOD detection. Meanwhile, several other …

Is a picture worth a thousand words? delving into spatial reasoning for vision language models

J Wang, Y Ming, Z Shi, V Vineet, X Wang, Y Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) and vision-language models (VLMs) have demonstrated
remarkable performance across a wide range of tasks and domains. Despite this promise …

Introduction to Generative Artificial Intelligence: Contextualizing the Future

R Singh, JY Kim, EF Glassy… - … of pathology & …, 2025 - meridian.allenpress.com
Context.—Generative artificial intelligence (GAI) is a promising new technology with the
potential to transform communication and workflows in health care and pathology. Although …

Large language models for anomaly and out-of-distribution detection: A survey

R Xu, K Ding - arXiv preprint arXiv:2409.01980, 2024 - arxiv.org
Detecting anomalies or out-of-distribution (OOD) samples is critical for maintaining the
reliability and trustworthiness of machine learning systems. Recently, Large Language …

Mmr: Evaluating reading ability of large multimodal models

J Chen, R Zhang, Y Zhou, R Rossi, J Gu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large multimodal models (LMMs) have demonstrated impressive capabilities in
understanding various types of image, including text-rich images. Most existing text-rich …

SkinGEN: An explainable dermatology diagnosis-to-generation framework with interactive vision-language models

B Lin, Y Xu, X Bao, Z Zhao, Z Zhang, Z Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
With the continuous advancement of vision language models (VLMs) technology,
remarkable research achievements have emerged in the dermatology field, the fourth most …

Generative Artificial Intelligence in Anatomic Pathology

V Brodsky, E Ullah, A Bychkov… - … of Pathology & …, 2025 - meridian.allenpress.com
Context.—Generative artificial intelligence (AI) has emerged as a transformative force in
various fields, including anatomic pathology, where it offers the potential to significantly …

Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation

S Lee, J Kim, S Hwang - arXiv preprint arXiv:2410.14975, 2024 - arxiv.org
With the recent emergence of foundation models trained on internet-scale data and
demonstrating remarkable generalization capabilities, such foundation models have …

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

S Onohara, A Miyai, Y Imajuku, K Egashira… - arXiv preprint arXiv …, 2024 - arxiv.org
Accelerating research on Large Multimodal Models (LMMs) in non-English languages is
crucial for enhancing user experiences across broader populations. In this paper, we …