A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor search

M Wang, X Xu, Q Yue, Y Wang - arXiv preprint arXiv:2101.12631, 2021 - arxiv.org
Approximate nearest neighbor search (ANNS) constitutes an important operation in a
multitude of applications, including recommendation systems, information retrieval, and …

Milvus: A purpose-built vector data management system

J Wang, X Yi, R Guo, H Jin, P Xu, S Li, X Wang… - Proceedings of the …, 2021 - dl.acm.org
Recently, there has been a pressing need to manage high-dimensional vector data in data
science and AI applications. This trend is fueled by the proliferation of unstructured data and …

Personal llm agents: Insights and survey about the capability, efficiency and security

Y Li, H Wen, W Wang, X Li, Y Yuan, G Liu, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …

Distributed hierarchical gpu parameter server for massive scale deep learning ads systems

W Zhao, D Xie, R Jia, Y Qian, R Ding… - … of Machine Learning …, 2020 - proceedings.mlsys.org
Neural networks of ads systems usually take input from multiple resources, eg query-ad
relevance, ad features and user portraits. These inputs are encoded into one-hot or multi-hot …

Identifying surface-enhanced raman spectra with a raman library using machine learning

Y Ju, O Neumann, M Bajomo, Y Zhao, P Nordlander… - ACS …, 2023 - ACS Publications
Since its discovery, surface-enhanced Raman spectroscopy (SERS) has shown outstanding
promise of identifying trace amounts of unknown molecules in rapid, portable formats …

AIBox: CTR prediction model training on a single node

W Zhao, J Zhang, D Xie, Y Qian, R Jia, P Li - Proceedings of the 28th …, 2019 - dl.acm.org
As one of the major search engines in the world, Baidu's Sponsored Search has long
adopted the use of deep neural network (DNN) models for Ads click-through rate (CTR) …

MOBIUS: towards the next generation of query-ad matching in baidu's sponsored search

M Fan, J Guo, S Zhu, S Miao, M Sun, P Li - Proceedings of the 25th ACM …, 2019 - dl.acm.org
Baidu runs the largest commercial web search engine in China, serving hundreds of millions
of online users every day in response to a great variety of queries. In order to build a high …

{CXL-ANNS}:{Software-Hardware} collaborative memory disaggregation and computation for {Billion-Scale} approximate nearest neighbor search

J Jang, H Choi, H Bae, S Lee, M Kwon… - 2023 USENIX Annual …, 2023 - usenix.org
We propose CXL-ANNS, a software-hardware collaborative approach to enable highly
scalable approximate nearest neighbor search (ANNS) services. To this end, we first …

Cagra: Highly parallel graph construction and approximate nearest neighbor search for gpus

H Ootomo, A Naruse, C Nolet, R Wang… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines
spanning data mining and artificial intelligence, from information retrieval and computer …

Manu: a cloud native vector database management system

R Guo, X Luan, L Xiang, X Yan, X Yi, J Luo… - arXiv preprint arXiv …, 2022 - arxiv.org
With the development of learning-based embedding models, embedding vectors are widely
used for analyzing and searching unstructured data. As vector collections exceed billion …