Explore, discover and learn: Unsupervised discovery of state-covering skills

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：254 相关文章所有 9 个版本

A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning

EF Morales, R Murrieta-Cid, I Becerra… - Intelligent Service …, 2021 - Springer

This article is about deep learning (DL) and deep reinforcement learning (DRL) works
applied to robotics. Both tools have been shown to be successful in delivering data-driven …

被引用次数：53 相关文章所有 4 个版本

[PDF] neurips.cc

Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

被引用次数：1306 相关文章所有 11 个版本

[PDF] neurips.cc

Behavior from the void: Unsupervised active pre-training

H Liu, P Abbeel - Advances in Neural Information …, 2021 - proceedings.neurips.cc

We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …

被引用次数：179 相关文章所有 7 个版本

[PDF] neurips.cc

Celebrating diversity in shared multi-agent reinforcement learning

C Li, T Wang, C Wu, Q Zhao… - Advances in Neural …, 2021 - proceedings.neurips.cc

Recently, deep multi-agent reinforcement learning (MARL) has shown the promise to solve
complex cooperative tasks. Its success is partly because of parameter sharing among …

被引用次数：128 相关文章所有 9 个版本

[PDF] neurips.cc

Discovering and achieving goals via world models

R Mendonca, O Rybkin, K Daniilidis… - Advances in …, 2021 - proceedings.neurips.cc

How can artificial agents learn to solve many diverse tasks in complex visual environments
without any supervision? We decompose this question into two challenges: discovering new …

被引用次数：110 相关文章所有 13 个版本

[PDF] jair.org Full View

Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey

C Colas, T Karch, O Sigaud, PY Oudeyer - Journal of Artificial Intelligence …, 2022 - jair.org

Building autonomous machines that can explore open-ended environments, discover
possible interactions and build repertoires of skills is a general objective of artificial …

被引用次数：109 相关文章所有 25 个版本

[PDF] arxiv.org

Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

被引用次数：66 相关文章所有 7 个版本

[PDF] jmlr.org

Convex reinforcement learning in finite trials

M Mutti, R De Santi, P De Bartolomeis… - Journal of Machine …, 2023 - jmlr.org

Convex Reinforcement Learning (RL) is a recently introduced framework that generalizes
the standard RL objective to any convex (or concave) function of the state distribution …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

Metra: Scalable unsupervised rl with metric-aware abstraction

S Park, O Rybkin, S Levine - arXiv preprint arXiv:2310.08887, 2023 - arxiv.org

Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

被引用次数：12 相关文章所有 4 个版本

高级搜索

QQ 群