Conservative data sharing for multi-task offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - Advances in …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …

Usher: Unbiased sampling for hindsight experience replay

L Schramm, Y Deng, E Granados… - Conference on Robot …, 2023 - proceedings.mlr.press
Dealing with sparse rewards is a long-standing challenge in reinforcement learning (RL).
Hindsight Experience Replay (HER) addresses this problem by reusing failed trajectories for …

Robotic Test Tube Rearrangement Using Combined Reinforcement Learning and Motion Planning

H Chen, W Wan, M Matsushita, T Kotaka… - arXiv preprint arXiv …, 2024 - arxiv.org
A combined task-level reinforcement learning and motion planning framework is proposed
in this paper to address a multi-class in-rack test tube rearrangement problem. At the task …

Clustering-based Failed goal Aware Hindsight Experience Replay

T Kim, T Kang, H Jeong, D Har - PeerJ Computer Science, 2024 - peerj.com
In a multi-goal reinforcement learning environment, an agent learns a policy to perform tasks
with multiple goals from experiences gained through exploration. In environments with …

Bias Resilient Multi-Step Off-Policy Goal-Conditioned Reinforcement Learning

L Wu, K Chen - arXiv preprint arXiv:2311.17565, 2023 - arxiv.org
In goal-conditioned reinforcement learning (GCRL), sparse rewards present significant
challenges, often obstructing efficient learning. Although multi-step GCRL can boost this …

Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning

T Yu, A Kumar, Y Chebotar, C Finn, S Levine… - 2021 - openreview.net
Offline reinforcement learning (RL) bears the promise to learn effective control policies from
static datasets but is thus far unable to learn from large databases of heterogeneous …

Boosting Learning Efficiency in Goal-Conditioned Reinforcement Learning: Skill Augmentation and Multi-Step Learning

L Wu - 2024 - search.proquest.com
Goal-conditioned reinforcement learning (GCRL) has emerged as a promising avenue in
machine learning, offering the potential to master multiple tasks tailored to specific goals and …

An unbiased method to train robots traveling in special conditions

T Zhou - AIP Conference Proceedings, 2024 - pubs.aip.org
It is a challenge to make robots move from one place to another on the shortest path and
avoid obstacles at the same time, especially when there are some special conditions occur …

[图书][B] Reinforcement Learning from Static Datasets: Algorithms, Analysis, and Applications

A Kumar - 2023 - search.proquest.com
Reinforcement learning (RL) provides a formalism for learning-based control. By attempting
to learn behavioral policies that can optimize a user-specified reward function, RL methods …

[图书][B] Building Versatile Reinforcement Learning Agents with Offline Data

T Yu - 2022 - search.proquest.com
Recent advances in machine learning using deep neural networks have shown significant
successes in learning from large datasets. However, these successes concentrated on …