From matching to generation: A survey on generative information retrieval

X Li, J Jin, Y Zhou, Y Zhang, P Zhang, Y Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
Information Retrieval (IR) systems are crucial tools for users to access information, widely
applied in scenarios like search engines, question answering, and recommendation …

Dolomites: Domain-Specific Long-Form Methodical Tasks

C Malaviya, P Agrawal, K Ganchev… - Transactions of the …, 2025 - direct.mit.edu
Experts in various fields routinely perform methodical writing tasks to plan, organize, and
report their work. From a clinician writing a differential diagnosis for a patient, to a teacher …

Defining knowledge: Bridging epistemology and large language models

C Fierro, R Dhar, F Stamatiou, N Garneau… - arXiv preprint arXiv …, 2024 - arxiv.org
Knowledge claims are abundant in the literature on large language models (LLMs); but can
we say that GPT-4 truly" knows" the Earth is round? To address this question, we review …

Citekit: A modular toolkit for large language model citation generation

J Shen, T Zhou, S Zhao, Y Chen, K Liu - arXiv preprint arXiv:2408.04662, 2024 - arxiv.org
Enabling Large Language Models (LLMs) to generate citations in Question-Answering (QA)
tasks is an emerging paradigm aimed at enhancing the verifiability of their responses when …

TruthReader: Towards Trustworthy Document Assistant Chatbot with Reliable Attribution

D Li, X Hu, Z Sun, B Hu, S Ye, Z Shan… - Proceedings of the …, 2024 - aclanthology.org
Document assistant chatbots are empowered with extensive capabilities by Large Language
Models (LLMs) and have exhibited significant advancements. However, these systems may …

ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models

J Yi, J Yin, J Xu, P Bao, Y Wang, W Fan… - arXiv preprint arXiv …, 2025 - arxiv.org
Vision-Language Models (VLMs) have demonstrated remarkable capabilities in
understanding multimodal inputs and have been widely integrated into Retrieval …

Scalable and Domain-General Abstractive Proposition Segmentation

MJ Hosseini, Y Gao, T Baumgärtner… - arXiv preprint arXiv …, 2024 - arxiv.org
Segmenting text into fine-grained units of meaning is important to a wide range of NLP
applications. The default approach of segmenting text into sentences is often insufficient …

Enhancing LLM's Reliability by Iterative Verification Attributions with Keyword Fronting

Y Sui, J Ren, H Tan, H Chen, Z Li, J Wang - Joint European Conference …, 2024 - Springer
Retrieval-augmented text generation attribution is of great significance for knowledge-
intensive tasks as it can enhance the credibility and verifiability of large language models …