相关文章- 学术资源搜索

Identifiability and generalizability in constrained inverse reinforcement learning

A Schlaginhaufen… - … Conference on Machine …, 2023 - proceedings.mlr.press

Two main challenges in Reinforcement Learning (RL) are designing appropriate reward
functions and ensuring the safety of the learned policy. To address these challenges, we …

被引用次数：12 相关文章所有 9 个版本

[PDF] arxiv.org

Regularized inverse reinforcement learning

W Jeon, CY Su, P Barde, T Doan… - arXiv preprint arXiv …, 2020 - arxiv.org

Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert
behavior by acquiring reward functions that explain the expert's decisions. Regularized IRL …

被引用次数：14 相关文章所有 9 个版本

[PDF] neurips.cc

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc

Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

被引用次数：34 相关文章所有 10 个版本

[PDF] arxiv.org

Maximum likelihood constraint inference for inverse reinforcement learning

DRR Scobee, SS Sastry - arXiv preprint arXiv:1909.05477, 2019 - arxiv.org

While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on
estimating a reward function that best explains an expert agent's policy or demonstrated …

被引用次数：75 相关文章所有 6 个版本

[PDF] neurips.cc

Identifiability in inverse reinforcement learning

H Cao, S Cohen, L Szpruch - Advances in Neural …, 2021 - proceedings.neurips.cc

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov
decision problem, using observations of agent actions. As already observed in Russell …

被引用次数：54 相关文章所有 7 个版本

[PDF] arxiv.org

LS-IQ: Implicit reward regularization for inverse reinforcement learning

F Al-Hafez, D Tateo, O Arenz, G Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent methods for imitation learning directly learn a $ Q $-function using an implicit reward
formulation rather than an explicit reward function. However, these methods generally …

被引用次数：22 相关文章所有 4 个版本

[PDF] mlr.press

Towards theoretical understanding of inverse reinforcement learning

AM Metelli, F Lazzati, M Restelli - … Conference on Machine …, 2023 - proceedings.mlr.press

Inverse reinforcement learning (IRL) denotes a powerful family of algorithms for recovering a
reward function justifying the behavior demonstrated by an expert agent. A well-known …

被引用次数：23 相关文章所有 10 个版本

[PDF] arxiv.org

Towards resolving unidentifiability in inverse reinforcement learning

K Amin, S Singh - arXiv preprint arXiv:1601.06569, 2016 - arxiv.org

We consider a setting for Inverse Reinforcement Learning (IRL) where the learner is
extended with the ability to actively select multiple environments, observing an agent's …

被引用次数：30 相关文章所有 2 个版本

[PDF] arxiv.org

BC-IRL: Learning generalizable reward functions from demonstrations

A Szot, A Zhang, D Batra, Z Kira, F Meier - arXiv preprint arXiv:2303.16194, 2023 - arxiv.org

How well do reward functions learned with inverse reinforcement learning (IRL) generalize?
We illustrate that state-of-the-art IRL algorithms, which maximize a maximum-entropy …

被引用次数：8 相关文章所有 3 个版本

[PDF] aaai.org

Misspecification in inverse reinforcement learning

J Skalse, A Abate - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

Abstract The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from
a policy pi. To do this, we need a model of how pi relates to R. In the current literature, the …

被引用次数：31 相关文章所有 5 个版本

高级搜索

QQ 群

Identifiability and generalizability in constrained inverse reinforcement learning

Regularized inverse reinforcement learning

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

Maximum likelihood constraint inference for inverse reinforcement learning

Identifiability in inverse reinforcement learning

LS-IQ: Implicit reward regularization for inverse reinforcement learning

Towards theoretical understanding of inverse reinforcement learning

Towards resolving unidentifiability in inverse reinforcement learning

BC-IRL: Learning generalizable reward functions from demonstrations

Misspecification in inverse reinforcement learning

相关搜索

引用