Scaling laws for a multi-agent reinforcement learning model

R Schaeffer, B Miranda… - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent work claims that large language models display\textit {emergent abilities}, abilities
not present in smaller-scale models that are present in larger-scale models. What makes …

被引用次数：445 相关文章所有 9 个版本

[PDF] arxiv.org

Broken neural scaling laws

E Caballero, K Gupta, I Rish, D Krueger - arXiv preprint arXiv:2210.14891, 2022 - arxiv.org

We present a smoothly broken power law functional form (that we refer to as a Broken
Neural Scaling Law (BNSL)) that accurately models & extrapolates the scaling behaviors of …

被引用次数：72 相关文章所有 6 个版本

[PDF] neurips.cc

Revisiting the minimalist approach to offline reinforcement learning

D Tarasov, V Kurenkov, A Nikulin… - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent years have witnessed significant advancements in offline reinforcement learning
(RL), resulting in the development of numerous algorithms with varying degrees of …

被引用次数：29 相关文章所有 6 个版本

[PDF] arxiv.org

Improving multimodal interactive agents with reinforcement learning from human feedback

J Abramson, A Ahuja, F Carnevale, P Georgiev… - arXiv preprint arXiv …, 2022 - arxiv.org

An important goal in artificial intelligence is to create agents that can both interact naturally
with humans and learn from their feedback. Here we demonstrate how to use reinforcement …

被引用次数：33 相关文章所有 2 个版本

[PDF] neurips.cc

Uncovering neural scaling laws in molecular representation learning

D Chen, Y Zhu, J Zhang, Y Du, Z Li… - Advances in …, 2024 - proceedings.neurips.cc

Abstract Molecular Representation Learning (MRL) has emerged as a powerful tool for drug
and materials discovery in a variety of tasks such as virtual screening and inverse design …

被引用次数：13 相关文章所有 7 个版本

[PDF] arxiv.org

Beyond scale: the diversity coefficient as a data quality metric demonstrates llms are pre-trained on formally diverse data

A Lee, B Miranda, S Koyejo - arXiv preprint arXiv:2306.13840, 2023 - arxiv.org

Current trends to pre-train capable Large Language Models (LLMs) mostly focus on scaling
of model and dataset size. However, the quality of pre-training data is an important factor for …

被引用次数：30 相关文章所有 7 个版本

[PDF] arxiv.org

Pretraining on the test set is all you need

R Schaeffer - arXiv preprint arXiv:2309.08632, 2023 - arxiv.org

Inspired by recent work demonstrating the promise of smaller Transformer-based language
models pretrained on carefully curated data, we supercharge such approaches by investing …

被引用次数：18 相关文章所有 3 个版本

[PDF] arxiv.org

Offline actor-critic reinforcement learning scales to large models

JT Springenberg, A Abdolmaleki, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

We show that offline actor-critic reinforcement learning can scale to large models-such as
transformers-and follows similar scaling laws as supervised learning. We find that offline …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Scaling laws for single-agent reinforcement learning

J Hilton, J Tang, J Schulman - arXiv preprint arXiv:2301.13442, 2023 - arxiv.org

Recent work has shown that, in generative modeling, cross-entropy loss improves smoothly
with model size and training compute, following a power law plus constant scaling law. One …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations

R Schaeffer, V Lecomte, DB Pai, A Carranza… - arXiv preprint arXiv …, 2024 - arxiv.org

Maximum Manifold Capacity Representations (MMCR) is a recent multi-view self-supervised
learning (MVSSL) method that matches or surpasses other leading MVSSL methods. MMCR …

被引用次数：2 相关文章所有 3 个版本

高级搜索

QQ 群