- 学术资源搜索

Rwkv: Reinventing rnns for the transformer era

B Peng, E Alcaide, Q Anthony, A Albalak… - arXiv preprint arXiv …, 2023 - arxiv.org

Transformers have revolutionized almost all natural language processing (NLP) tasks but
suffer from memory and computational complexity that scales quadratically with sequence …

被引用次数：220 相关文章所有 9 个版本

[PDF] arxiv.org

Planning with diffusion for flexible behavior synthesis

M Janner, Y Du, JB Tenenbaum, S Levine - arXiv preprint arXiv …, 2022 - arxiv.org

Model-based reinforcement learning methods often use learning only for the purpose of
estimating an approximate dynamics model, offloading the rest of the decision-making work …

被引用次数：357 相关文章所有 4 个版本

[HTML] biorxiv.org

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

H Cui, C Wang, H Maan, K Pang, F Luo, N Duan… - Nature …, 2024 - nature.com

Generative pretrained models have achieved remarkable success in various domains such
as language and computer vision. Specifically, the combination of large-scale diverse …

被引用次数：119 相关文章所有 7 个版本

[PDF] arxiv.org

Training a helpful and harmless assistant with reinforcement learning from human feedback

Y Bai, A Jones, K Ndousse, A Askell, A Chen… - arXiv preprint arXiv …, 2022 - arxiv.org

We apply preference modeling and reinforcement learning from human feedback (RLHF) to
finetune language models to act as helpful and harmless assistants. We find this alignment …

被引用次数：929 相关文章所有 2 个版本

[PDF] neurips.cc

Vision gnn: An image is worth graph of nodes

K Han, Y Wang, J Guo, Y Tang… - Advances in neural …, 2022 - proceedings.neurips.cc

Network architecture plays a key role in the deep learning-based computer vision system.
The widely-used convolutional neural network and transformer treat the image as a grid or …

被引用次数：258 相关文章所有 8 个版本

[PDF] neurips.cc

Non-stationary transformers: Exploring the stationarity in time series forecasting

Y Liu, H Wu, J Wang, M Long - Advances in Neural …, 2022 - proceedings.neurips.cc

Transformers have shown great power in time series forecasting due to their global-range
modeling ability. However, their performance can degenerate terribly on non-stationary real …

被引用次数：261 相关文章所有 8 个版本

[PDF] arxiv.org

Tallrec: An effective and efficient tuning framework to align large language model with recommendation

K Bao, J Zhang, Y Zhang, W Wang, F Feng… - Proceedings of the 17th …, 2023 - dl.acm.org

Large Language Models (LLMs) have demonstrated remarkable performance across
diverse domains, thereby prompting researchers to explore their potential for use in …

被引用次数：177 相关文章所有 5 个版本

[PDF] neurips.cc

Videomae: Masked autoencoders are data-efficient learners for self-supervised video pre-training

Z Tong, Y Song, J Wang… - Advances in neural …, 2022 - proceedings.neurips.cc

Pre-training video transformers on extra large-scale datasets is generally required to
achieve premier performance on relatively small datasets. In this paper, we show that video …

被引用次数：729 相关文章所有 6 个版本

[PDF] thecvf.com

Efficientvit: Memory efficient vision transformer with cascaded group attention

X Liu, H Peng, N Zheng, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Vision transformers have shown great success due to their high model capabilities.
However, their remarkable performance is accompanied by heavy computation costs, which …

被引用次数：122 相关文章所有 8 个版本

[PDF] arxiv.org

Tensorf: Tensorial radiance fields

A Chen, Z Xu, A Geiger, J Yu, H Su - European conference on computer …, 2022 - Springer

We present TensoRF, a novel approach to model and reconstruct radiance fields. Unlike
NeRF that purely uses MLPs, we model the radiance field of a scene as a 4D tensor, which …

被引用次数：939 相关文章所有 8 个版本

高级搜索

QQ 群

Rwkv: Reinventing rnns for the transformer era

Planning with diffusion for flexible behavior synthesis

scGPT: toward building a foundation model for single-cell multi-omics using generative AI

Training a helpful and harmless assistant with reinforcement learning from human feedback

Vision gnn: An image is worth graph of nodes

Non-stationary transformers: Exploring the stationarity in time series forecasting

Tallrec: An effective and efficient tuning framework to align large language model with recommendation

Videomae: Masked autoencoders are data-efficient learners for self-supervised video pre-training

Efficientvit: Memory efficient vision transformer with cascaded group attention

Tensorf: Tensorial radiance fields

引用