Deep reinforcement learning: A brief survey

K Arulkumaran, MP Deisenroth… - IEEE Signal …, 2017 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) is poised to revolutionize the field of artificial intelligence
(AI) and represents a step toward building autonomous systems with a higher-level …

A brief survey of deep reinforcement learning

K Arulkumaran, MP Deisenroth, M Brundage… - arXiv preprint arXiv …, 2017 - arxiv.org
Deep reinforcement learning is poised to revolutionise the field of AI and represents a step
towards building autonomous systems with a higher level understanding of the visual world …

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

Y Chebotar, Q Vuong, K Hausman… - … on Robot Learning, 2023 - proceedings.mlr.press
In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …

Grandmaster level in StarCraft II using multi-agent reinforcement learning

O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu… - nature, 2019 - nature.com
Many real-world applications require artificial agents to compete and coordinate with other
agents in complex environments. As a stepping stone to this goal, the domain of StarCraft …

Hindsight experience replay

M Andrychowicz, F Wolski, A Ray… - Advances in neural …, 2017 - proceedings.neurips.cc
Dealing with sparse rewards is one of the biggest challenges in Reinforcement Learning
(RL). We present a novel technique called Hindsight Experience Replay which allows …

A generalized algorithm for multi-objective reinforcement learning and policy adaptation

R Yang, X Sun, K Narasimhan - Advances in neural …, 2019 - proceedings.neurips.cc
We introduce a new algorithm for multi-objective reinforcement learning (MORL) with linear
preferences, with the goal of enabling few-shot adaptation to new tasks. In MORL, the aim is …

Deep reinforcement learning based volt-var optimization in smart distribution systems

Y Zhang, X Wang, J Wang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This paper develops a model-free volt-VAR optimization (VVO) algorithm via multi-agent
deep reinforcement learning (DRL) in unbalanced distribution systems. This method is novel …

Language as an abstraction for hierarchical deep reinforcement learning

Y Jiang, SS Gu, KP Murphy… - Advances in Neural …, 2019 - proceedings.neurips.cc
Solving complex, temporally-extended tasks is a long-standing problem in reinforcement
learning (RL). We hypothesize that one critical element of solving such problems is the …

Action branching architectures for deep reinforcement learning

A Tavakoli, F Pardo, P Kormushev - … of the aaai conference on artificial …, 2018 - ojs.aaai.org
Discrete-action algorithms have been central to numerous recent successes of deep
reinforcement learning. However, applying these algorithms to high-dimensional action …

Temporal difference models: Model-free deep rl for model-based control

V Pong, S Gu, M Dalal, S Levine - arXiv preprint arXiv:1802.09081, 2018 - arxiv.org
Model-free reinforcement learning (RL) is a powerful, general tool for learning complex
behaviors. However, its sample efficiency is often impractically large for solving challenging …