Deep reinforcement learning at the edge of the statistical precipice

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc
Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

Chainerrl: A deep reinforcement learning library

Y Fujita, P Nagarajan, T Kataoka, T Ishikawa - Journal of Machine Learning …, 2021 - jmlr.org
In this paper, we introduce ChainerRL, an open-source deep reinforcement learning (DRL)
library built using Python and the Chainer deep learning framework. ChainerRL implements …

Deep reinforcement learning with dqn vs. ppo in vizdoom

A Zakharenkov, I Makarov - 2021 IEEE 21st International …, 2021 - ieeexplore.ieee.org
VizDoom is a flexible and easy-to-use 3D reinforcement learning research platform based
on the well-known Doom first-person shooter. The challenge is to create bots that compete …

Learn to move through a combination of policy gradient algorithms: Ddpg, d4pg, and td3

N Bach, A Melnik, M Schilling, T Korthals… - … , Optimization, and Data …, 2020 - Springer
Abstract Deep Reinforcement Learning has recently seen progress for continuous control
tasks, driven by yearly challenges such as the NeurIPS Competition Track. This work …

Sim2real for peg-hole insertion with eye-in-hand camera

D Bogunowicz, A Rybnikov, K Vendidandi… - arXiv preprint arXiv …, 2020 - arxiv.org
Even though the peg-hole insertion is one of the well-studied problems in robotics, it still
remains a challenge for robots, especially when it comes to flexibility and the ability to …

Project proposal: A modular reinforcement learning based automated theorem prover

B Shminke - arXiv preprint arXiv:2209.02562, 2022 - arxiv.org
We propose to build a reinforcement learning prover of independent components: a
deductive system (an environment), the proof state representation (how an agent sees the …