Constructing tree-based index for efficient and effective dense retrieval

H Li, Q Ai, J Zhan, J Mao, Y Liu, Z Liu… - Proceedings of the 46th …, 2023 - dl.acm.org
Recent studies have shown that Dense Retrieval (DR) techniques can significantly improve
the performance of first-stage retrieval in IR systems. Despite its empirical effectiveness, the …

Recent Approaches and Trends in Approximate Nearest Neighbor Search, with Remarks on Benchmarking.

M Aumüller, M Ceccarello - IEEE Data Eng. Bull., 2023 - sites.computer.org
Nearest neighbor search is a computational primitive whose efficiency is paramount to many
applications. As such, the literature recently blossomed with many works focusing on …

Taxonomy-guided Semantic Indexing for Academic Paper Search

SK Kang, Y Zhang, P Jiang, D Lee, J Han… - arXiv preprint arXiv …, 2024 - arxiv.org
Academic paper search is an essential task for efficient literature discovery and scientific
advancement. While dense retrieval has advanced various ad-hoc searches, it often …

Reverse maximum inner product search: Formulation, algorithms, and analysis

D Amagata, T Hara - ACM Transactions on the Web, 2023 - dl.acm.org
The maximum inner product search (MIPS), which finds the item with the highest inner
product with a given query user, is an essential problem in the recommendation field …

Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection

H Shah, K Mittal, A Rajwade - European Conference on Computer Vision, 2025 - Springer
This work presents an adaptive group testing framework for the range-based high
dimensional near neighbor search problem. Our method efficiently marks each item in a …

Arkade: k-Nearest Neighbor Search With Non-Euclidean Distances using GPU Ray Tracing

DK Mandarapu, V Nagarajan, A Pelenitsyn… - Proceedings of the 38th …, 2024 - dl.acm.org
High-performance implementations of k-Nearest Neighbor Search (kNN) in low dimensions
use tree-based data structures. Tree algorithms are hard to parallelize on GPUs due to their …

Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

K Lu, C Xiao, Y Ishikawa - arXiv preprint arXiv:2402.11354, 2024 - arxiv.org
Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal
challenge in the field of machine learning. In recent years, graph-based methods have …

A Simple Linear Space Data Structure for ANN with Application in Differential Privacy

M Aumüller, F Boninsegna, F Silvestri - arXiv preprint arXiv:2409.07187, 2024 - arxiv.org
Locality Sensitive Filters are known for offering a quasi-linear space data structure with
rigorous guarantees for the Approximate Near Neighbor search problem. Building on …

Scalable Density-based Clustering with Random Projections

H Xu, N Pham - arXiv preprint arXiv:2402.15679, 2024 - arxiv.org
We present sDBSCAN, a scalable density-based clustering algorithm in high dimensions
with cosine distance. Utilizing the neighborhood-preserving property of random projections …

Group Testing for Accurate and Efficient Range-Based Near Neighbor Search: An Adaptive Binary Splitting Approach

K Mittal, H Shah, A Rajwade - arXiv preprint arXiv:2311.02573, 2023 - arxiv.org
This work presents an adaptive group testing framework for the range-based high
dimensional near neighbor search problem. The proposed method detects high-similarity …