Retrieval-augmented generation for natural language processing: A survey

S Wu, Y Xiong, Y Cui, H Wu, C Chen, Y Yuan… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have demonstrated great success in various fields,
benefiting from their huge amount of parameters that store knowledge. However, LLMs still …

Efficient methods for natural language processing: A survey

M Treviso, JU Lee, T Ji, B Aken, Q Cao… - Transactions of the …, 2023 - direct.mit.edu
Recent work in natural language processing (NLP) has yielded appealing results from
scaling model parameters and training data; however, using only scale to improve …

A survey on retrieval-augmented text generation

H Li, Y Su, D Cai, Y Wang, L Liu - arXiv preprint arXiv:2202.01110, 2022 - arxiv.org
Recently, retrieval-augmented text generation attracted increasing attention of the
computational linguistics community. Compared with conventional generation models …

A voyage on neural machine translation for Indic languages

SK Sheshadri, D Gupta, MR Costa-Jussà - Procedia Computer Science, 2023 - Elsevier
With the invention of deep learning concepts, Machine Translation (MT) migrated towards
Neural Machine Translation (NMT) architectures, eventually from Statistical Machine …

Goodtriever: Adaptive toxicity mitigation with retrieval-augmented models

L Pozzobon, B Ermis, P Lewis, S Hooker - arXiv preprint arXiv:2310.07589, 2023 - arxiv.org
Considerable effort has been dedicated to mitigating toxicity, but existing methods often
require drastic modifications to model parameters or the use of computationally intensive …

Efficient machine translation domain adaptation

PH Martins, Z Marinho, AFT Martins - arXiv preprint arXiv:2204.12608, 2022 - arxiv.org
Machine translation models struggle when translating out-of-domain text, which makes
domain adaptation a topic of critical importance. However, most domain adaptation methods …

Gnn-lm: Language modeling based on global contexts via gnn

Y Meng, S Zong, X Li, X Sun, T Zhang, F Wu… - arXiv preprint arXiv …, 2021 - arxiv.org
Inspired by the notion that``{\it to copy is easier than to memorize}``, in this work, we
introduce GNN-LM, which extends the vanilla neural language model (LM) by allowing to …

Chunk-based nearest neighbor machine translation

PH Martins, Z Marinho, AFT Martins - arXiv preprint arXiv:2205.12230, 2022 - arxiv.org
Semi-parametric models, which augment generation with retrieval, have led to impressive
results in language modeling and machine translation, due to their ability to retrieve fine …

Towards robust k-nearest-neighbor machine translation

H Jiang, Z Lu, F Meng, C Zhou, J Zhou… - arXiv preprint arXiv …, 2022 - arxiv.org
k-Nearest-Neighbor Machine Translation (kNN-MT) becomes an important research
direction of NMT in recent years. Its main idea is to retrieve useful key-value pairs from an …

NN-NER: Named Entity Recognition with Nearest Neighbor Search

S Wang, X Li, Y Meng, T Zhang, R Ouyang, J Li… - arXiv preprint arXiv …, 2022 - arxiv.org
Inspired by recent advances in retrieval augmented methods in NLP~\citep {
khandelwal2019generalization, khandelwal2020nearest, meng2021gnn}, in this paper, we …