- 学术资源搜索

Deep reinforcement learning: A brief survey

K Arulkumaran, MP Deisenroth… - IEEE Signal …, 2017 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) is poised to revolutionize the field of artificial intelligence
(AI) and represents a step toward building autonomous systems with a higher-level …

被引用次数：3320 相关文章所有 6 个版本

[PDF] arxiv.org

A brief survey of deep reinforcement learning

K Arulkumaran, MP Deisenroth, M Brundage… - arXiv preprint arXiv …, 2017 - arxiv.org

Deep reinforcement learning is poised to revolutionise the field of AI and represents a step
towards building autonomous systems with a higher level understanding of the visual world …

被引用次数：1018 相关文章所有 12 个版本

[PDF] arxiv.org

Mastering diverse domains through world models

D Hafner, J Pasukonis, J Ba, T Lillicrap - arXiv preprint arXiv:2301.04104, 2023 - arxiv.org

Developing a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …

被引用次数：325 相关文章所有 2 个版本

[PDF] arxiv.org

Mastering atari with discrete world models

D Hafner, T Lillicrap, M Norouzi, J Ba - arXiv preprint arXiv:2010.02193, 2020 - arxiv.org

Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …

被引用次数：744 相关文章所有 7 个版本

[PDF] arxiv.org

Soft actor-critic algorithms and applications

T Haarnoja, A Zhou, K Hartikainen, G Tucker… - arXiv preprint arXiv …, 2018 - arxiv.org

Model-free deep reinforcement learning (RL) algorithms have been successfully applied to a
range of challenging sequential decision making and control tasks. However, these methods …

被引用次数：2617 相关文章所有 4 个版本

[PDF] nowpublishers.com

An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

被引用次数：1726 相关文章所有 16 个版本

[PDF] arxiv.org

First return, then explore

A Ecoffet, J Huizinga, J Lehman, KO Stanley, J Clune - Nature, 2021 - nature.com

Reinforcement learning promises to solve complex sequential-decision problems
autonomously by specifying a high-level reward function only. However, reinforcement …

被引用次数：361 相关文章所有 10 个版本

[PDF] mlr.press

Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures

L Espeholt, H Soyer, R Munos… - International …, 2018 - proceedings.mlr.press

In this work we aim to solve a large collection of tasks using a single reinforcement learning
agent with a single set of parameters. A key challenge is to handle the increased amount of …

被引用次数：1596 相关文章所有 8 个版本

[PDF] mlr.press

Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor

T Haarnoja, A Zhou, P Abbeel… - … conference on machine …, 2018 - proceedings.mlr.press

Abstract Model-free deep reinforcement learning (RL) algorithms have been demonstrated
on a range of challenging decision making and control tasks. However, these methods …

被引用次数：8514 相关文章所有 7 个版本

[PDF] mlr.press

Fully decentralized multi-agent reinforcement learning with networked agents

K Zhang, Z Yang, H Liu, T Zhang… - … conference on machine …, 2018 - proceedings.mlr.press

We consider the fully decentralized multi-agent reinforcement learning (MARL) problem,
where the agents are connected via a time-varying and possibly sparse communication …

被引用次数：657 相关文章所有 8 个版本

高级搜索

QQ 群

Deep reinforcement learning: A brief survey

A brief survey of deep reinforcement learning

Mastering diverse domains through world models

Mastering atari with discrete world models

Soft actor-critic algorithms and applications

An introduction to deep reinforcement learning

First return, then explore

Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures

Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor

Fully decentralized multi-agent reinforcement learning with networked agents

引用