In-context retrieval-augmented language models

O Ram, Y Levine, I Dalmedigos, D Muhlgay… - Transactions of the …, 2023 - direct.mit.edu
Abstract Retrieval-Augmented Language Modeling (RALM) methods, which condition a
language model (LM) on relevant documents from a grounding corpus during generation …

Augmented language models: a survey

G Mialon, R Dessì, M Lomeli, C Nalmpantis… - arXiv preprint arXiv …, 2023 - arxiv.org
This survey reviews works in which language models (LMs) are augmented with reasoning
skills and the ability to use tools. The former is defined as decomposing a potentially …

Enabling large language models to generate text with citations

T Gao, H Yen, J Yu, D Chen - arXiv preprint arXiv:2305.14627, 2023 - arxiv.org
Large language models (LLMs) have emerged as a widely-used tool for information
seeking, but their generated outputs are prone to hallucination. In this work, our aim is to …

Benchmarking large language models in retrieval-augmented generation

J Chen, H Lin, X Han, L Sun - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Retrieval-Augmented Generation (RAG) is a promising approach for mitigating the
hallucination of large language models (LLMs). However, existing research lacks rigorous …

Interleaving retrieval with chain-of-thought reasoning for knowledge-intensive multi-step questions

H Trivedi, N Balasubramanian, T Khot… - arXiv preprint arXiv …, 2022 - arxiv.org
Prompting-based large language models (LLMs) are surprisingly powerful at generating
natural language reasoning steps or Chains-of-Thoughts (CoT) for multi-step question …

Large language models for information retrieval: A survey

Y Zhu, H Yuan, S Wang, J Liu, W Liu, C Deng… - arXiv preprint arXiv …, 2023 - arxiv.org
As a primary means of information acquisition, information retrieval (IR) systems, such as
search engines, have integrated themselves into our daily lives. These systems also serve …

[HTML][HTML] Information retrieval meets large language models: a strategic report from chinese ir community

Q Ai, T Bai, Z Cao, Y Chang, J Chen, Z Chen, Z Cheng… - AI Open, 2023 - Elsevier
The research field of Information Retrieval (IR) has evolved significantly, expanding beyond
traditional search to meet diverse user information needs. Recently, Large Language …

Landmark attention: Random-access infinite context length for transformers

A Mohtashami, M Jaggi - arXiv preprint arXiv:2305.16300, 2023 - arxiv.org
While Transformers have shown remarkable success in natural language processing, their
attention mechanism's large memory requirements have limited their ability to handle longer …

When Large Language Models Meet Vector Databases: A Survey

Z Jing, Y Su, Y Han, B Yuan, C Liu, H Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
The recent burst in Large Language Models has opened new frontiers in human-like text
processing and generation. However, alongside their remarkable growth, Large Language …

Knowledge-augmented reasoning distillation for small language models in knowledge-intensive tasks

M Kang, S Lee, J Baek… - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract Large Language Models (LLMs) have shown promising performance in knowledge-
intensive reasoning tasks that require a compound understanding of knowledge. However …