Deep reinforcement learning with plasticity injection

E Nikishin, J Oh, G Ostrovski, C Lyle… - Advances in …, 2024 - proceedings.neurips.cc
A growing body of evidence suggests that neural networks employed in deep reinforcement
learning (RL) gradually lose their plasticity, the ability to learn from new data; however, the …

[图书][B] Foundations of deep reinforcement learning: theory and practice in Python

L Graesser, WL Keng - 2019 - books.google.com
Deep reinforcement learning (deep RL) combines deep learning and reinforcement
learning, in which artificial agents learn to solve sequential decision-making problems. In the …

[图书][B] Deep learning in science

P Baldi - 2021 - books.google.com
This is the first rigorous, self-contained treatment of the theory of deep learning. Starting with
the foundations of the theory and building it up, this is essential reading for any scientists …

Drivegan: Towards a controllable high-quality neural simulation

SW Kim, J Philion, A Torralba… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Realistic simulators are critical for training and verifying robotics systems. While most of the
contemporary simulators are hand-crafted, a scaleable way to build simulators is to use …

High fidelity video prediction with large stochastic recurrent neural networks

R Villegas, A Pathak, H Kannan… - Advances in …, 2019 - proceedings.neurips.cc
Predicting future video frames is extremely challenging, as there are many factors of
variation that make up the dynamics of how frames change through time. Previously …

Simulation-based reinforcement learning for real-world autonomous driving

B Osiński, A Jakubowski, P Zięcina… - … on robotics and …, 2020 - ieeexplore.ieee.org
We use reinforcement learning in simulation to obtain a driving system controlling a full-size
real-world vehicle. The driving policy takes RGB images from a single camera and their …

Muesli: Combining improvements in policy optimization

M Hessel, I Danihelka, F Viola, A Guez… - International …, 2021 - proceedings.mlr.press
We propose a novel policy update that combines regularized policy optimization with model
learning as an auxiliary loss. The update (henceforth Muesli) matches MuZero's state-of-the …

Exploring model-based planning with policy networks

T Wang, J Ba - arXiv preprint arXiv:1906.08649, 2019 - arxiv.org
Model-based reinforcement learning (MBRL) with model-predictive control or online
planning has shown great potential for locomotion control tasks in terms of both sample …

Neuroevolution of self-interpretable agents

Y Tang, D Nguyen, D Ha - Proceedings of the 2020 Genetic and …, 2020 - dl.acm.org
Inattentional blindness is the psychological phenomenon that causes one to miss things in
plain sight. It is a consequence of the selective attention in perception that lets us remain …

Fast task inference with variational intrinsic successor features

S Hansen, W Dabney, A Barreto… - arXiv preprint arXiv …, 2019 - arxiv.org
It has been established that diverse behaviors spanning the controllable subspace of an
Markov decision process can be trained by rewarding a policy for being distinguishable from …