Identifiability in inverse reinforcement learning

H Cao, S Cohen, L Szpruch - Advances in Neural …, 2021 - proceedings.neurips.cc
Inverse reinforcement learning attempts to reconstruct the reward function in a Markov
decision problem, using observations of agent actions. As already observed in Russell …

Identifiability and generalizability in constrained inverse reinforcement learning

A Schlaginhaufen… - … Conference on Machine …, 2023 - proceedings.mlr.press
Two main challenges in Reinforcement Learning (RL) are designing appropriate reward
functions and ensuring the safety of the learned policy. To address these challenges, we …

Continuous inverse optimal control with locally optimal examples

S Levine, V Koltun - arXiv preprint arXiv:1206.4617, 2012 - arxiv.org
Inverse optimal control, also known as inverse reinforcement learning, is the problem of
recovering an unknown reward function in a Markov decision process from expert …

Maximum likelihood constraint inference for inverse reinforcement learning

DRR Scobee, SS Sastry - arXiv preprint arXiv:1909.05477, 2019 - arxiv.org
While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on
estimating a reward function that best explains an expert agent's policy or demonstrated …

Towards theoretical understanding of inverse reinforcement learning

AM Metelli, F Lazzati, M Restelli - … Conference on Machine …, 2023 - proceedings.mlr.press
Inverse reinforcement learning (IRL) denotes a powerful family of algorithms for recovering a
reward function justifying the behavior demonstrated by an expert agent. A well-known …

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Provably efficient learning of transferable rewards

AM Metelli, G Ramponi, A Concetti… - … on Machine Learning, 2021 - proceedings.mlr.press
The reward function is widely accepted as a succinct, robust, and transferable
representation of a task. Typical approaches, at the basis of Inverse Reinforcement Learning …

Efficient probabilistic performance bounds for inverse reinforcement learning

D Brown, S Niekum - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org
In the field of reinforcement learning there has been recent progress towards safety and high-
confidence bounds on policy performance. However, to our knowledge, no practical …

Learning from a learner

A Jacq, M Geist, A Paiva… - … Conference on Machine …, 2019 - proceedings.mlr.press
In this paper, we propose a novel setting for Inverse Reinforcement Learning (IRL), namely"
Learning from a Learner"(LfL). As opposed to standard IRL, it does not consist in learning a …

Deep inverse Q-learning with constraints

G Kalweit, M Huegle, M Werling… - Advances in neural …, 2020 - proceedings.neurips.cc
Abstract Popular Maximum Entropy Inverse Reinforcement Learning approaches require the
computation of expected state visitation frequencies for the optimal policy under an estimate …