Maximum likelihood constraint inference for inverse reinforcement learning

DRR Scobee, SS Sastry - arXiv preprint arXiv:1909.05477, 2019 - arxiv.org
While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on
estimating a reward function that best explains an expert agent's policy or demonstrated …

Towards resolving unidentifiability in inverse reinforcement learning

K Amin, S Singh - arXiv preprint arXiv:1601.06569, 2016 - arxiv.org
We consider a setting for Inverse Reinforcement Learning (IRL) where the learner is
extended with the ability to actively select multiple environments, observing an agent's …

Inverse reinforcement learning with simultaneous estimation of rewards and dynamics

M Herman, T Gindele, J Wagner… - Artificial intelligence …, 2016 - proceedings.mlr.press
Abstract Inverse Reinforcement Learning (IRL) describes the problem of learning an
unknown reward function of a Markov Decision Process (MDP) from observed behavior of …

Identifiability and generalizability in constrained inverse reinforcement learning

A Schlaginhaufen… - … Conference on Machine …, 2023 - proceedings.mlr.press
Two main challenges in Reinforcement Learning (RL) are designing appropriate reward
functions and ensuring the safety of the learned policy. To address these challenges, we …

Learning soft constraints from constrained expert demonstrations

A Gaurav, K Rezaee, G Liu, P Poupart - arXiv preprint arXiv:2206.01311, 2022 - arxiv.org
Inverse reinforcement learning (IRL) methods assume that the expert data is generated by
an agent optimizing some reward function. However, in many settings, the agent may …

Towards theoretical understanding of inverse reinforcement learning

AM Metelli, F Lazzati, M Restelli - … Conference on Machine …, 2023 - proceedings.mlr.press
Inverse reinforcement learning (IRL) denotes a powerful family of algorithms for recovering a
reward function justifying the behavior demonstrated by an expert agent. A well-known …

Misspecification in inverse reinforcement learning

J Skalse, A Abate - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
Abstract The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from
a policy pi. To do this, we need a model of how pi relates to R. In the current literature, the …

A survey of inverse reinforcement learning: Challenges, methods and progress

S Arora, P Doshi - Artificial Intelligence, 2021 - Elsevier
Inverse reinforcement learning (IRL) is the problem of inferring the reward function of an
agent, given its policy or observed behavior. Analogous to RL, IRL is perceived both as a …

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

[PDF][PDF] Inverse reinforcement learning in partially observable environments

JD Choi, KE Kim - Journal of Machine Learning Research, 2011 - jmlr.org
Inverse reinforcement learning (IRL) is the problem of recovering the underlying reward
function from the behavior of an expert. Most of the existing IRL algorithms assume that the …