In-context reinforcement learning with algorithm distillation

X Wang, W Wang, Y Cao, C Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com

In-context learning, as a new paradigm in NLP, allows the model to rapidly adapt to various
tasks with only a handful of prompts and examples. But in computer vision, the difficulties for …

被引用次数：166 相关文章所有 5 个版本

[PDF] mlr.press

Transformers as algorithms: Generalization and stability in in-context learning

Y Li, ME Ildiz, D Papailiopoulos… - … on Machine Learning, 2023 - proceedings.mlr.press

In-context learning (ICL) is a type of prompting where a transformer model operates on a
sequence of (input, output) examples and performs inference on-the-fly. In this work, we …

被引用次数：92 相关文章所有 8 个版本

[PDF] arxiv.org

Large language models as general pattern machines

S Mirchandani, F Xia, P Florence, B Ichter… - arXiv preprint arXiv …, 2023 - arxiv.org

We observe that pre-trained large language models (LLMs) are capable of autoregressively
completing complex token sequences--from arbitrary ones procedurally generated by …

被引用次数：116 相关文章所有 4 个版本

[PDF] neurips.cc

Supervised pretraining can learn in-context reinforcement learning

J Lee, A Xie, A Pacchiano, Y Chandak… - Advances in …, 2024 - proceedings.neurips.cc

Large transformer models trained on diverse datasets have shown a remarkable ability to
learn in-context, achieving high few-shot performance on tasks they were not explicitly …

被引用次数：34 相关文章所有 7 个版本

[PDF] arxiv.org

Foundation models for decision making: Problems, methods, and opportunities

S Yang, O Nachum, Y Du, J Wei, P Abbeel… - arXiv preprint arXiv …, 2023 - arxiv.org

Foundation models pretrained on diverse data at scale have demonstrated extraordinary
capabilities in a wide range of vision and language tasks. When such models are deployed …

被引用次数：110 相关文章所有 3 个版本

[PDF] arxiv.org

On Transforming Reinforcement Learning With Transformers: The Development Trajectory

S Hu, L Shen, Y Zhang, Y Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Transformers, originally devised for natural language processing (NLP), have also produced
significant successes in computer vision (CV). Due to their strong expression power …

被引用次数：17 相关文章所有 5 个版本

[PDF] neurips.cc

Structured state space models for in-context reinforcement learning

C Lu, Y Schroecker, A Gu, E Parisotto… - Advances in …, 2024 - proceedings.neurips.cc

Structured state space sequence (S4) models have recently achieved state-of-the-art
performance on long-range sequence modeling tasks. These models also have fast …

被引用次数：50 相关文章所有 6 个版本

[PDF] neurips.cc

The learnability of in-context learning

N Wies, Y Levine, A Shashua - Advances in Neural …, 2024 - proceedings.neurips.cc

In-context learning is a surprising and important phenomenon that emerged when modern
language models were scaled to billions of learned parameters. Without modifying a large …

被引用次数：48 相关文章所有 5 个版本

[PDF] arxiv.org

Language modeling is compression

G Delétang, A Ruoss, PA Duquenne, E Catt… - arXiv preprint arXiv …, 2023 - arxiv.org

It has long been established that predictive models can be transformed into lossless
compressors and vice versa. Incidentally, in recent years, the machine learning community …

被引用次数：83 相关文章所有 3 个版本

[PDF] mlr.press

The wisdom of hindsight makes language models better instruction followers

T Zhang, F Liu, J Wong, P Abbeel… - … on Machine Learning, 2023 - proceedings.mlr.press

Reinforcement learning has seen wide success in finetuning large language models to
better align with instructions via human feedback. The so-called algorithm, Reinforcement …

被引用次数：40 相关文章所有 7 个版本

高级搜索

QQ 群