- 学术资源搜索

Deep reinforcement learning with plasticity injection

E Nikishin, J Oh, G Ostrovski, C Lyle… - Advances in …, 2024 - proceedings.neurips.cc

A growing body of evidence suggests that neural networks employed in deep reinforcement
learning (RL) gradually lose their plasticity, the ability to learn from new data; however, the …

被引用次数：21 相关文章所有 6 个版本

[PDF] ua.es

[图书][B] Foundations of deep reinforcement learning: theory and practice in Python

L Graesser, WL Keng - 2019 - books.google.com

Deep reinforcement learning (deep RL) combines deep learning and reinforcement
learning, in which artificial agents learn to solve sequential decision-making problems. In the …

被引用次数：157 相关文章所有 6 个版本

[PDF] uci.edu

[图书][B] Deep learning in science

P Baldi - 2021 - books.google.com

This is the first rigorous, self-contained treatment of the theory of deep learning. Starting with
the foundations of the theory and building it up, this is essential reading for any scientists …

被引用次数：98 相关文章所有 9 个版本

[PDF] thecvf.com

Drivegan: Towards a controllable high-quality neural simulation

SW Kim, J Philion, A Torralba… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Realistic simulators are critical for training and verifying robotics systems. While most of the
contemporary simulators are hand-crafted, a scaleable way to build simulators is to use …

被引用次数：81 相关文章所有 6 个版本

[PDF] neurips.cc

High fidelity video prediction with large stochastic recurrent neural networks

R Villegas, A Pathak, H Kannan… - Advances in …, 2019 - proceedings.neurips.cc

Predicting future video frames is extremely challenging, as there are many factors of
variation that make up the dynamics of how frames change through time. Previously …

被引用次数：149 相关文章所有 10 个版本

[PDF] arxiv.org

Simulation-based reinforcement learning for real-world autonomous driving

B Osiński, A Jakubowski, P Zięcina… - … on robotics and …, 2020 - ieeexplore.ieee.org

We use reinforcement learning in simulation to obtain a driving system controlling a full-size
real-world vehicle. The driving policy takes RGB images from a single camera and their …

被引用次数：140 相关文章所有 11 个版本

[PDF] mlr.press

Muesli: Combining improvements in policy optimization

M Hessel, I Danihelka, F Viola, A Guez… - International …, 2021 - proceedings.mlr.press

We propose a novel policy update that combines regularized policy optimization with model
learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the …

被引用次数：77 相关文章所有 5 个版本

[PDF] arxiv.org

Exploring model-based planning with policy networks

T Wang, J Ba - arXiv preprint arXiv:1906.08649, 2019 - arxiv.org

Model-based reinforcement learning (MBRL) with model-predictive control or online
planning has shown great potential for locomotion control tasks in terms of both sample …

被引用次数：164 相关文章所有 4 个版本

[PDF] acm.org

Neuroevolution of self-interpretable agents

Y Tang, D Nguyen, D Ha - Proceedings of the 2020 Genetic and …, 2020 - dl.acm.org

Inattentional blindness is the psychological phenomenon that causes one to miss things in
plain sight. It is a consequence of the selective attention in perception that lets us remain …

被引用次数：128 相关文章所有 5 个版本

[PDF] arxiv.org

Fast task inference with variational intrinsic successor features

S Hansen, W Dabney, A Barreto… - arXiv preprint arXiv …, 2019 - arxiv.org

It has been established that diverse behaviors spanning the controllable subspace of an
Markov decision process can be trained by rewarding a policy for being distinguishable from …

被引用次数：159 相关文章所有 5 个版本

高级搜索

QQ 群

Deep reinforcement learning with plasticity injection

[图书][B] Foundations of deep reinforcement learning: theory and practice in Python

[图书][B] Deep learning in science

Drivegan: Towards a controllable high-quality neural simulation

High fidelity video prediction with large stochastic recurrent neural networks

Simulation-based reinforcement learning for real-world autonomous driving

Muesli: Combining improvements in policy optimization

Exploring model-based planning with policy networks

Neuroevolution of self-interpretable agents

Fast task inference with variational intrinsic successor features

引用