Practical and optimal LSH for angular distance

Z Liu, J Wang, T Dao, T Zhou, B Yuan… - International …, 2023 - proceedings.mlr.press

Large language models (LLMs) with hundreds of billions of parameters have sparked a new
wave of exciting AI applications. However, they are computationally expensive at inference …

被引用次数：158 相关文章所有 7 个版本

[PDF] thecvf.com

Image super-resolution with non-local sparse attention

Y Mei, Y Fan, Y Zhou - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com

Both non-local (NL) operation and sparse representation are crucial for Single Image Super-
Resolution (SISR). In this paper, we investigate their combinations and propose a novel Non …

被引用次数：505 相关文章所有 4 个版本

[PDF] openreview.net

Reformer: The efficient transformer

N Kitaev, Ł Kaiser, A Levskaya - arXiv preprint arXiv:2001.04451, 2020 - arxiv.org

Large Transformer models routinely achieve state-of-the-art results on a number of tasks but
training these models can be prohibitively costly, especially on long sequences. We …

被引用次数：2668 相关文章所有 4 个版本

[PDF] usenix.org

The limitations of federated learning in sybil settings

C Fung, CJM Yoon, I Beschastnikh - 23rd International Symposium on …, 2020 - usenix.org

Federated learning over distributed multi-party data is an emerging paradigm that iteratively
aggregates updates from a group of devices to train a globally shared model. Relying on a …

被引用次数：358 相关文章所有 7 个版本

[PDF] arxiv.org

ETC: Encoding long and structured inputs in transformers

J Ainslie, S Ontanon, C Alberti, V Cvicek… - arXiv preprint arXiv …, 2020 - arxiv.org

Transformer models have advanced the state of the art in many Natural Language
Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended …

被引用次数：378 相关文章所有 4 个版本

[PDF] mlr.press

Accelerating large-scale inference with anisotropic vector quantization

R Guo, P Sun, E Lindgren, Q Geng… - International …, 2020 - proceedings.mlr.press

Quantization based techniques are the current state-of-the-art for scaling maximum inner
product search to massive databases. Traditional approaches to quantization aim to …

被引用次数：401 相关文章所有 10 个版本

[PDF] arxiv.org

Mitigating sybils in federated learning poisoning

C Fung, CJM Yoon, I Beschastnikh - arXiv preprint arXiv:1808.04866, 2018 - arxiv.org

Machine learning (ML) over distributed multi-party data is required for a variety of domains.
Existing approaches, such as federated learning, collect the outputs computed by a group of …

被引用次数：610 相关文章所有 4 个版本

[PDF] arxiv.org

Deep k-nearest neighbors: Towards confident, interpretable and robust deep learning

N Papernot, P McDaniel - arXiv preprint arXiv:1803.04765, 2018 - arxiv.org

Deep neural networks (DNNs) enable innovative applications of machine learning like
image recognition, machine translation, or malware detection. However, deep learning is …

被引用次数：589 相关文章所有 3 个版本

[PDF] neurips.cc

Scatterbrain: Unifying sparse and low-rank attention

B Chen, T Dao, E Winsor, Z Song… - Advances in Neural …, 2021 - proceedings.neurips.cc

Recent advances in efficient Transformers have exploited either the sparsity or low-rank
properties of attention matrices to reduce the computational and memory bottlenecks of …

被引用次数：115 相关文章所有 7 个版本

[PDF] arxiv.org

Survey of vector database management systems

JJ Pan, J Wang, G Li - The VLDB Journal, 2024 - Springer

There are now over 20 commercial vector database management systems (VDBMSs), all
produced within the past five years. But embedding-based retrieval has been studied for …

被引用次数：31 相关文章

高级搜索

QQ 群