deep reinforcement high level policies- 学术资源搜索

Deep reinforcement learning: An overview

Y Li - arXiv preprint arXiv:1701.07274, 2017 - arxiv.org

… , function approximation, policy optimization, deep RL, RL … To have a good understanding
of deep reinforcement learning, … a top level action value function and a lower level action …

被引用次数：1805 相关文章所有 6 个版本

[PDF] ucl.ac.uk

Deep reinforcement learning: A brief survey

K Arulkumaran, MP Deisenroth… - IEEE Signal …, 2017 - ieeexplore.ieee.org

… ), policies could also run other policies (multitime-step “actions”) [79]. This approach allows
toplevel policies to focus on higher-level … by using one top-level policy that chooses between …

被引用次数：3278 相关文章所有 6 个版本

[PDF] arxiv.org

Using deep reinforcement learning to learn high-level policies on the atrias biped

T Li, H Geyer, CG Atkeson, A Rai - … International Conference on …, 2019 - ieeexplore.ieee.org

… In this work, we used deep reinforcement learning to learn two neural network policies to …
One of the policies uses a general neural network, while the second builds on the structure …

被引用次数：56 相关文章所有 5 个版本

[PDF] arxiv.org

A brief survey of deep reinforcement learning

K Arulkumaran, MP Deisenroth, M Brundage… - arXiv preprint arXiv …, 2017 - arxiv.org

… with a higher level understanding of the visual world. Currently, … and policybased methods.
Our survey will cover central algorithms in deep reinforcement learning, including the deep Q-…

被引用次数：1011 相关文章所有 12 个版本

[PDF] neurips.cc

Language as an abstraction for hierarchical deep reinforcement learning

Y Jiang, SS Gu, KP Murphy… - Advances in Neural …, 2019 - proceedings.neurips.cc

… , particularly when combined with existing reinforcement learning algorithms. We explore
how we might incorporate a language model into the high level policy in Appendix A, which …

被引用次数：227 相关文章所有 9 个版本

[PDF] neurips.cc

Edge: Explaining deep reinforcement learning policies

W Guo, X Wu, U Khan, X Xing - Advances in Neural …, 2021 - proceedings.neurips.cc

… At a high level, our method identifies the important time steps by approximating the target
agent’s decision-making process with a self-explainable model and extracting the explanations …

被引用次数：59 相关文章所有 7 个版本

Multi-level policy and reward-based deep reinforcement learning framework for image captioning

N Xu, H Zhang, AA Liu, W Nie, Y Su… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

… multi-level policy and reward reinforcement learning framework … -level policy network aims
to jointly update the word- and sentence-level policies for word generation, and the multi-level …

被引用次数：100 相关文章

[PDF] github.io

Human-level control through deep reinforcement learning

V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness… - nature, 2015 - nature.com

… high-dimensional data ( colour video at 60 Hz) as input—to demonstrate that our approach
robustly learns successful policies … to break through to the top level of bricks and the value …

被引用次数：30583 相关文章所有 57 个版本

Deeploco: Dynamic locomotion skills using hierarchical deep reinforcement learning

XB Peng, G Berseth, KK Yin… - Acm transactions on …, 2017 - dl.acm.org

… deep reinforcement learning (RL) to learn control policies at both timescales. The use of deep
… of objective functions for low-level and highlevel policies. Taken together, the hierarchical …

被引用次数：640 相关文章所有 3 个版本

[PDF] arxiv.org

Adversarial policies: Attacking deep reinforcement learning

A Gleave, M Dennis, C Wild, N Kant, S Levine… - arXiv preprint arXiv …, 2019 - arxiv.org

… policies are more successful in high-dimensional environments, and induce substantially
different activations in the victim policy … Additionally, we find policies are easier to attack in high-…

被引用次数：407 相关文章所有 10 个版本

高级搜索

QQ 群

Deep reinforcement learning: An overview

Deep reinforcement learning: A brief survey

Using deep reinforcement learning to learn high-level policies on the atrias biped

A brief survey of deep reinforcement learning

Language as an abstraction for hierarchical deep reinforcement learning

Edge: Explaining deep reinforcement learning policies

Multi-level policy and reward-based deep reinforcement learning framework for image captioning

Human-level control through deep reinforcement learning

Deeploco: Dynamic locomotion skills using hierarchical deep reinforcement learning

Adversarial policies: Attacking deep reinforcement learning

相关搜索

引用