- 学术资源搜索

Conservative data sharing for multi-task offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - Advances in …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …

被引用次数：85 相关文章所有 6 个版本

[PDF] mlr.press

Usher: Unbiased sampling for hindsight experience replay

L Schramm, Y Deng, E Granados… - Conference on Robot …, 2023 - proceedings.mlr.press

Dealing with sparse rewards is a long-standing challenge in reinforcement learning (RL).
Hindsight Experience Replay (HER) addresses this problem by reusing failed trajectories for …

被引用次数：9 相关文章所有 9 个版本

[PDF] arxiv.org

Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning

H Chen, W Wan, M Matsushita, T Kotaka… - arXiv preprint arXiv …, 2024 - arxiv.org

A combined task-level reinforcement learning and motion planning framework is proposed
in this paper to address a multi-class in-rack test tube rearrangement problem. At the task …

被引用次数：1 相关文章所有 2 个版本

[PDF] peerj.com

Clustering-based Failed goal Aware Hindsight Experience Replay

T Kim, T Kang, H Jeong, D Har - PeerJ Computer Science, 2024 - peerj.com

In a multi-goal reinforcement learning environment, an agent learns a policy to perform tasks
with multiple goals from experiences gained through exploration. In environments with …

Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning

L Wu, K Chen - arXiv preprint arXiv:2311.17565, 2023 - arxiv.org

In goal-conditioned reinforcement learning (GCRL), sparse rewards present significant
challenges, often obstructing efficient learning. Although multi-step GCRL can boost this …

被引用次数：2 相关文章所有 5 个版本

[PDF] openreview.net

Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning

T Yu, A Kumar, Y Chebotar, C Finn, S Levine… - 2021 - openreview.net

Offline reinforcement learning (RL) bears the promise to learn effective control policies from
static datasets but is thus far unable to learn from large databases of heterogeneous …

被引用次数：2 相关文章所有 2 个版本

[PDF] manchester.ac.uk

[图书][B] Building Versatile Reinforcement Learning Agents with Offline Data

T Yu - 2022 - search.proquest.com

Recent advances in machine learning using deep neural networks have shown significant
successes in learning from large datasets. However, these successes concentrated on …

高级搜索

QQ 群