Exploration in deep reinforcement learning: A survey

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

[HTML][HTML] A review of uncertainty quantification in deep learning: Techniques, applications and challenges

M Abdar, F Pourpanah, S Hussain, D Rezazadegan… - Information fusion, 2021 - Elsevier
Uncertainty quantification (UQ) methods play a pivotal role in reducing the impact of
uncertainties during both optimization and decision making processes. They have been …

Interesting object, curious agent: Learning task-agnostic exploration

S Parisi, V Dean, D Pathak… - Advances in Neural …, 2021 - proceedings.neurips.cc
Common approaches for task-agnostic exploration learn tabula-rasa--the agent assumes
isolated environments and no prior knowledge or experience. However, in the real world …

Self-supervised learning for joint pushing and grasping policies in highly cluttered environments

Y Wang, K Mokhtar, C Heemskerk, H Kasaei - arXiv preprint arXiv …, 2022 - arxiv.org
Robots often face situations where grasping a goal object is desirable but not feasible due to
other present objects preventing the grasp action. We present a deep Reinforcement …

A Novel Heuristic Exploration Method Based on Action Effectiveness Constraints to Relieve Loop Enhancement Effect in Reinforcement Learning with Sparse …

Z Ni, Y Jin, P Liu, W Zhao - Cognitive Computation, 2024 - Springer
In realistic sparse reward tasks, existing theoretical methods cannot be effectively applied
due to the low sampling probability ofrewarded episodes. Profound research on methods …

Long-term visitation value for deep exploration in sparse-reward reinforcement learning

S Parisi, D Tateo, M Hensel, C D'eramo, J Peters… - Algorithms, 2022 - mdpi.com
Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely
on getting feedback via extrinsic rewards to train the agent, and in situations where this …

Beyond Optimism: Exploration With Partially Observable Rewards

S Parisi, A Kazemipour, M Bowling - arXiv preprint arXiv:2406.13909, 2024 - arxiv.org
Exploration in reinforcement learning (RL) remains an open challenge. RL algorithms rely
on observing rewards to train the agent, and if informative rewards are sparse the agent …

[HTML][HTML] Efficient Exploration and Robustness in Controlled Dynamical Systems

A Russo - 2023 - diva-portal.org
In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient
exploration in multi-task bandit models and model-free exploration in large Markov decision …

Multi-Armed Bandits for Minesweeper: Profiting From Exploration–Exploitation Synergy

IQ Lordeiro, DB Haddad… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
A popular computer puzzle, the game of Minesweeper requires its human players to have a
mix of both luck and strategy to succeed. Analyzing these aspects more formally, in our …

[PDF][PDF] From Reinforcement Learning to Robot Learning: Leveraging Prior Data and Shared Evaluation

V Dean - 2023 - kilthub.cmu.edu
Deep learning holds promise for learning complex patterns from data, which is especially
useful when the input or output space is large. In robot learning, both the input (images or …