- 学术资源搜索

Efficient memory management for large language model serving with pagedattention

W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng… - Proceedings of the 29th …, 2023 - dl.acm.org

High throughput serving of large language models (LLMs) requires batching sufficiently
many requests at a time. However, existing systems struggle because the key-value cache …

被引用次数：453 相关文章所有 4 个版本

[PDF] thecvf.com

4d gaussian splatting for real-time dynamic scene rendering

G Wu, T Yi, J Fang, L Xie, X Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Representing and rendering dynamic scenes has been an important but challenging task.
Especially to accurately model complex motions high efficiency is usually hard to guarantee …

被引用次数：185 相关文章所有 3 个版本

[PDF] thecvf.com

Vipergpt: Visual inference via python execution for reasoning

D Surís, S Menon, C Vondrick - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Answering visual queries is a complex task that requires both visual processing and
reasoning. End-to-end models, the dominant approach for this task, do not explicitly …

被引用次数：258 相关文章所有 6 个版本

[PDF] neurips.cc

Objaverse-xl: A universe of 10m+ 3d objects

M Deitke, R Liu, M Wallingford, H Ngo… - Advances in …, 2024 - proceedings.neurips.cc

Natural language processing and 2D vision models have attained remarkable proficiency on
many tasks primarily by escalating the scale of training data. However, 3D vision tasks have …

被引用次数：148 相关文章所有 6 个版本

[PDF] arxiv.org

ediff-i: Text-to-image diffusion models with an ensemble of expert denoisers

Y Balaji, S Nah, X Huang, A Vahdat, J Song… - arXiv preprint arXiv …, 2022 - arxiv.org

Large-scale diffusion-based generative models have led to breakthroughs in text-
conditioned high-resolution image synthesis. Starting from random noise, such text-to-image …

被引用次数：506 相关文章所有 2 个版本

[HTML] acm.org

Nerfstudio: A modular framework for neural radiance field development

M Tancik, E Weber, E Ng, R Li, B Yi, T Wang… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org

Neural Radiance Fields (NeRF) are a rapidly growing area of research with wide-ranging
applications in computer vision, graphics, robotics, and more. In order to streamline the …

被引用次数：316 相关文章所有 3 个版本

[HTML] nih.gov

Robust deep learning–based protein sequence design using ProteinMPNN

J Dauparas, I Anishchenko, N Bennett, H Bai… - Science, 2022 - science.org

Although deep learning has revolutionized protein structure prediction, almost all
experimentally characterized de novo protein designs have been generated using …

被引用次数：644 相关文章所有 13 个版本

[PDF] neurips.cc

Flashattention: Fast and memory-efficient exact attention with io-awareness

T Dao, D Fu, S Ermon, A Rudra… - Advances in Neural …, 2022 - proceedings.neurips.cc

Transformers are slow and memory-hungry on long sequences, since the time and memory
complexity of self-attention are quadratic in sequence length. Approximate attention …

被引用次数：944 相关文章所有 10 个版本

[PDF] arxiv.org

[PDF][PDF] Timesnet: Temporal 2d-variation modeling for general time series analysis

H Wu, T Hu, Y Liu, H Zhou, J Wang, M Long - arXiv preprint arXiv …, 2022 - arxiv.org

Time series analysis is of immense importance in extensive applications, such as weather
forecasting, anomaly detection, and action recognition. This paper focuses on temporal …

被引用次数：389 相关文章所有 5 个版本

[PDF] neurips.cc

Large language models are zero-shot reasoners

T Kojima, SS Gu, M Reid, Y Matsuo… - Advances in neural …, 2022 - proceedings.neurips.cc

Pretrained large language models (LLMs) are widely used in many sub-fields of natural
language processing (NLP) and generally known as excellent few-shot learners with task …

被引用次数：2351 相关文章所有 11 个版本

高级搜索

QQ 群

Efficient memory management for large language model serving with pagedattention

4d gaussian splatting for real-time dynamic scene rendering

Vipergpt: Visual inference via python execution for reasoning

Objaverse-xl: A universe of 10m+ 3d objects

ediff-i: Text-to-image diffusion models with an ensemble of expert denoisers

Nerfstudio: A modular framework for neural radiance field development

Robust deep learning–based protein sequence design using ProteinMPNN

Flashattention: Fast and memory-efficient exact attention with io-awareness

[PDF][PDF] Timesnet: Temporal 2d-variation modeling for general time series analysis

Large language models are zero-shot reasoners

引用