Catalyst. RL: a distributed framework for reproducible RL research

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc

Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

被引用次数：543 相关文章所有 6 个版本

[PDF] jmlr.org

Chainerrl: A deep reinforcement learning library

Y Fujita, P Nagarajan, T Kataoka, T Ishikawa - Journal of Machine Learning …, 2021 - jmlr.org

In this paper, we introduce ChainerRL, an open-source deep reinforcement learning (DRL)
library built using Python and the Chainer deep learning framework. ChainerRL implements …

被引用次数：130 相关文章所有 8 个版本

Deep reinforcement learning with dqn vs. ppo in vizdoom

A Zakharenkov, I Makarov - 2021 IEEE 21st International …, 2021 - ieeexplore.ieee.org

VizDoom is a flexible and easy-to-use 3D reinforcement learning research platform based
on the well-known Doom first-person shooter. The challenge is to create bots that compete …

被引用次数：12 相关文章所有 2 个版本

[PDF] core.ac.uk

Learn to move through a combination of policy gradient algorithms: Ddpg, d4pg, and td3

N Bach, A Melnik, M Schilling, T Korthals… - … , Optimization, and Data …, 2020 - Springer

Abstract Deep Reinforcement Learning has recently seen progress for continuous control
tasks, driven by yearly challenges such as the NeurIPS Competition Track. This work …

被引用次数：22 相关文章所有 5 个版本

[PDF] arxiv.org

Sim2real for peg-hole insertion with eye-in-hand camera

D Bogunowicz, A Rybnikov, K Vendidandi… - arXiv preprint arXiv …, 2020 - arxiv.org

Even though the peg-hole insertion is one of the well-studied problems in robotics, it still
remains a challenge for robots, especially when it comes to flexibility and the ability to …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Project proposal: A modular reinforcement learning based automated theorem prover

B Shminke - arXiv preprint arXiv:2209.02562, 2022 - arxiv.org

We propose to build a reinforcement learning prover of independent components: a
deductive system (an environment), the proof state representation (how an agent sees the …

高级搜索

QQ 群