Model-based reinforcement learning for atari

K Hippalgaonkar, Q Li, X Wang, JW Fisher III… - Nature Reviews …, 2023 - nature.com

As materials researchers increasingly embrace machine-learning (ML) methods, it is natural
to wonder what lessons can be learned from other fields undergoing similar developments …

被引用次数：57 相关文章所有 3 个版本

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arXiv preprint arXiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

被引用次数：1738 相关文章所有 3 个版本

[PDF] arxiv.org

Mastering diverse domains through world models

D Hafner, J Pasukonis, J Ba, T Lillicrap - arXiv preprint arXiv:2301.04104, 2023 - arxiv.org

General intelligence requires solving tasks across many domains. Current reinforcement
learning algorithms carry this potential but are held back by the resources and knowledge …

被引用次数：296 相关文章所有 3 个版本

[PDF] arxiv.org

Planning with diffusion for flexible behavior synthesis

M Janner, Y Du, JB Tenenbaum, S Levine - arXiv preprint arXiv …, 2022 - arxiv.org

Model-based reinforcement learning methods often use learning only for the purpose of
estimating an approximate dynamics model, offloading the rest of the decision-making work …

被引用次数：340 相关文章所有 5 个版本

[PDF] neurips.cc

Deep reinforcement learning at the edge of the statistical precipice

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

被引用次数：546 相关文章所有 6 个版本

[PDF] arxiv.org

A cookbook of self-supervised learning

R Balestriero, M Ibrahim, V Sobal, A Morcos… - arXiv preprint arXiv …, 2023 - arxiv.org

Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to
advance machine learning. Yet, much like cooking, training SSL methods is a delicate art …

被引用次数：204 相关文章所有 5 个版本

[PDF] neurips.cc

Flexible diffusion modeling of long videos

W Harvey, S Naderiparizi, V Masrani… - Advances in …, 2022 - proceedings.neurips.cc

We present a framework for video modeling based on denoising diffusion probabilistic
models that produces long-duration video completions in a variety of realistic environments …

被引用次数：181 相关文章所有 8 个版本

[PDF] mlr.press

Bigger, better, faster: Human-level atari with human-level efficiency

M Schwarzer, JSO Ceron, A Courville… - International …, 2023 - proceedings.mlr.press

We introduce a value-based RL agent, which we call BBF, that achieves super-human
performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used …

被引用次数：45 相关文章所有 7 个版本

[PDF] arxiv.org

Mastering atari with discrete world models

D Hafner, T Lillicrap, M Norouzi, J Ba - arXiv preprint arXiv:2010.02193, 2020 - arxiv.org

Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …

被引用次数：719 相关文章所有 8 个版本

[HTML] sciltp.com

[HTML][HTML] Reinforcement learning for disassembly system optimization problems: A survey

X Guo, Z Bi, J Wang, S Qin, S Liu, L Qi - International Journal of Network …, 2023 - sciltp.com

The disassembly complexity of end-of-life products increases continuously. Traditional
methods are facing difficulties in solving the decision-making and control problems of …

被引用次数：41 相关文章所有 2 个版本

高级搜索

QQ 群