Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

Survey of vector database management systems

JJ Pan, J Wang, G Li - The VLDB Journal, 2024 - Springer
There are now over 20 commercial vector database management systems (VDBMSs), all
produced within the past five years. But embedding-based retrieval has been studied for …

A neural corpus indexer for document retrieval

Y Wang, Y Hou, H Wang, Z Miao… - Advances in …, 2022 - proceedings.neurips.cc
Current state-of-the-art document retrieval solutions mainly follow an index-retrieve
paradigm, where the index is hard to be directly optimized for the final retrieval target. In this …

Data distillation: A survey

N Sachdeva, J McAuley - arXiv preprint arXiv:2301.04272, 2023 - arxiv.org
The popularity of deep learning has led to the curation of a vast number of massive and
multifarious datasets. Despite having close-to-human performance on individual tasks …

When large language models meet vector databases: A survey

Z Jing, Y Su, Y Han, B Yuan, H Xu, C Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
This survey explores the synergistic potential of Large Language Models (LLMs) and Vector
Databases (VecDBs), a burgeoning but rapidly evolving research area. With the proliferation …

Personal llm agents: Insights and survey about the capability, efficiency and security

Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …

Model-enhanced vector index

H Zhang, Y Wang, Q Chen, R Chang… - Advances in …, 2024 - proceedings.neurips.cc
Embedding-based retrieval methods construct vector indices to search for document
representations that are most similar to the query representations. They are widely used in …

Retrieval-augmented generation for ai-generated content: A survey

P Zhao, H Zhang, Q Yu, Z Wang, Y Geng, F Fu… - arXiv preprint arXiv …, 2024 - arxiv.org
The development of Artificial Intelligence Generated Content (AIGC) has been facilitated by
advancements in model algorithms, scalable foundation model architectures, and the …

Worst-case performance of popular approximate nearest neighbor search implementations: Guarantees and limitations

P Indyk, H Xu - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc
Graph-based approaches to nearest neighbor search are popular and powerful tools for
handling large datasets in practice, but they have limited theoretical guarantees. We study …

{CXL-ANNS}:{Software-Hardware} collaborative memory disaggregation and computation for {Billion-Scale} approximate nearest neighbor search

J Jang, H Choi, H Bae, S Lee, M Kwon… - 2023 USENIX Annual …, 2023 - usenix.org
We propose CXL-ANNS, a software-hardware collaborative approach to enable highly
scalable approximate nearest neighbor search (ANNS) services. To this end, we first …