Exploiting action-value uncertainty to drive exploration in reinforcement learning

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier

This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

被引用次数：203 相关文章所有 5 个版本

[HTML] sciencedirect.com

[HTML][HTML] A review of uncertainty quantification in deep learning: Techniques, applications and challenges

M Abdar, F Pourpanah, S Hussain, D Rezazadegan… - Information fusion, 2021 - Elsevier

Uncertainty quantification (UQ) methods play a pivotal role in reducing the impact of
uncertainties during both optimization and decision making processes. They have been …

被引用次数：1889 相关文章所有 12 个版本

[PDF] neurips.cc

Interesting object, curious agent: Learning task-agnostic exploration

S Parisi, V Dean, D Pathak… - Advances in Neural …, 2021 - proceedings.neurips.cc

Common approaches for task-agnostic exploration learn tabula-rasa--the agent assumes
isolated environments and no prior knowledge or experience. However, in the real world …

被引用次数：46 相关文章所有 8 个版本

[PDF] arxiv.org

Self-supervised learning for joint pushing and grasping policies in highly cluttered environments

Y Wang, K Mokhtar, C Heemskerk, H Kasaei - arXiv preprint arXiv …, 2022 - arxiv.org

Robots often face situations where grasping a goal object is desirable but not feasible due to
other present objects preventing the grasp action. We present a deep Reinforcement …

被引用次数：4 相关文章所有 4 个版本

A Novel Heuristic Exploration Method Based on Action Effectiveness Constraints to Relieve Loop Enhancement Effect in Reinforcement Learning with Sparse …

Z Ni, Y Jin, P Liu, W Zhao - Cognitive Computation, 2024 - Springer

In realistic sparse reward tasks, existing theoretical methods cannot be effectively applied
due to the low sampling probability ofrewarded episodes. Profound research on methods …

[PDF] mdpi.com

Long-term visitation value for deep exploration in sparse-reward reinforcement learning

S Parisi, D Tateo, M Hensel, C D'eramo, J Peters… - Algorithms, 2022 - mdpi.com

Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely
on getting feedback via extrinsic rewards to train the agent, and in situations where this …

被引用次数：9 相关文章所有 14 个版本

[PDF] arxiv.org

Beyond Optimism: Exploration With Partially Observable Rewards

S Parisi, A Kazemipour, M Bowling - arXiv preprint arXiv:2406.13909, 2024 - arxiv.org

Exploration in reinforcement learning (RL) remains an open challenge. RL algorithms rely
on observing rewards to train the agent, and if informative rewards are sparse the agent …

[HTML] diva-portal.org

[HTML][HTML] Efficient Exploration and Robustness in Controlled Dynamical Systems

A Russo - 2023 - diva-portal.org

In this thesis, we explore two distinct topics. The first part of the thesis delves into efficient
exploration in multi-task bandit models and model-free exploration in large Markov decision …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-Armed Bandits for Minesweeper: Profiting From Exploration–Exploitation Synergy

IQ Lordeiro, DB Haddad… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

A popular computer puzzle, the game of Minesweeper requires its human players to have a
mix of both luck and strategy to succeed. Analyzing these aspects more formally, in our …

被引用次数：4 相关文章所有 5 个版本

[PDF] cmu.edu

[PDF][PDF] From Reinforcement Learning to Robot Learning: Leveraging Prior Data and Shared Evaluation

V Dean - 2023 - kilthub.cmu.edu

Deep learning holds promise for learning complex patterns from data, which is especially
useful when the input or output space is large. In robot learning, both the input (images or …

高级搜索

QQ 群