Identifiability and generalizability in constrained inverse reinforcement learning

A Schlaginhaufen… - … Conference on Machine …, 2023 - proceedings.mlr.press
Two main challenges in Reinforcement Learning (RL) are designing appropriate reward
functions and ensuring the safety of the learned policy. To address these challenges, we …

Regularized inverse reinforcement learning

W Jeon, CY Su, P Barde, T Doan… - arXiv preprint arXiv …, 2020 - arxiv.org
Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert
behavior by acquiring reward functions that explain the expert's decisions. Regularized IRL …

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum likelihood constraint inference for inverse reinforcement learning

DRR Scobee, SS Sastry - arXiv preprint arXiv:1909.05477, 2019 - arxiv.org
While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on
estimating a reward function that best explains an expert agent's policy or demonstrated …

Identifiability in inverse reinforcement learning

H Cao, S Cohen, L Szpruch - Advances in Neural …, 2021 - proceedings.neurips.cc
Inverse reinforcement learning attempts to reconstruct the reward function in a Markov
decision problem, using observations of agent actions. As already observed in Russell …

LS-IQ: Implicit reward regularization for inverse reinforcement learning

F Al-Hafez, D Tateo, O Arenz, G Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent methods for imitation learning directly learn a $ Q $-function using an implicit reward
formulation rather than an explicit reward function. However, these methods generally …

Towards theoretical understanding of inverse reinforcement learning

AM Metelli, F Lazzati, M Restelli - … Conference on Machine …, 2023 - proceedings.mlr.press
Inverse reinforcement learning (IRL) denotes a powerful family of algorithms for recovering a
reward function justifying the behavior demonstrated by an expert agent. A well-known …

Towards resolving unidentifiability in inverse reinforcement learning

K Amin, S Singh - arXiv preprint arXiv:1601.06569, 2016 - arxiv.org
We consider a setting for Inverse Reinforcement Learning (IRL) where the learner is
extended with the ability to actively select multiple environments, observing an agent's …

BC-IRL: Learning generalizable reward functions from demonstrations

A Szot, A Zhang, D Batra, Z Kira, F Meier - arXiv preprint arXiv:2303.16194, 2023 - arxiv.org
How well do reward functions learned with inverse reinforcement learning (IRL) generalize?
We illustrate that state-of-the-art IRL algorithms, which maximize a maximum-entropy …

Misspecification in inverse reinforcement learning

J Skalse, A Abate - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
Abstract The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from
a policy pi. To do this, we need a model of how pi relates to R. In the current literature, the …