Toward computationally efficient inverse reinforcement learning via reward shaping

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Toward computationally efficient inverse reinforcement learning via reward shaping

在引用文章中搜索

[PDF] arxiv.org

EvIL: Evolution Strategies for Generalisable Imitation Learning

S Sapora, G Swamy, C Lu, YW Teh… - arXiv preprint arXiv …, 2024 - arxiv.org

Often times in imitation learning (IL), the environment we collect expert demonstrations in
and the environment we want to deploy our learned policy in aren't exactly the same (eg …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Bootstrapped Reward Shaping

J Adamczyk, V Makarenko, S Tiomkin… - arXiv preprint arXiv …, 2025 - arxiv.org

In reinforcement learning, especially in sparse-reward domains, many environment steps
are required to observe reward information. In order to increase the frequency of such …

高级搜索

QQ 群

Toward computationally efficient inverse reinforcement learning via reward shaping

EvIL: Evolution Strategies for Generalisable Imitation Learning

Bootstrapped Reward Shaping

引用