相关文章- 学术资源搜索

Identifiability in inverse reinforcement learning

H Cao, S Cohen, L Szpruch - Advances in Neural …, 2021 - proceedings.neurips.cc

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov
decision problem, using observations of agent actions. As already observed in Russell …

被引用次数：54 相关文章所有 7 个版本

[PDF] mlr.press

Identifiability and generalizability in constrained inverse reinforcement learning

A Schlaginhaufen… - … Conference on Machine …, 2023 - proceedings.mlr.press

Two main challenges in Reinforcement Learning (RL) are designing appropriate reward
functions and ensuring the safety of the learned policy. To address these challenges, we …

被引用次数：13 相关文章所有 9 个版本

[PDF] arxiv.org

Continuous inverse optimal control with locally optimal examples

S Levine, V Koltun - arXiv preprint arXiv:1206.4617, 2012 - arxiv.org

Inverse optimal control, also known as inverse reinforcement learning, is the problem of
recovering an unknown reward function in a Markov decision process from expert …

被引用次数：417 相关文章所有 14 个版本

[PDF] arxiv.org

Maximum likelihood constraint inference for inverse reinforcement learning

DRR Scobee, SS Sastry - arXiv preprint arXiv:1909.05477, 2019 - arxiv.org

While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on
estimating a reward function that best explains an expert agent's policy or demonstrated …

被引用次数：75 相关文章所有 6 个版本

[PDF] mlr.press

Towards theoretical understanding of inverse reinforcement learning

AM Metelli, F Lazzati, M Restelli - … Conference on Machine …, 2023 - proceedings.mlr.press

Inverse reinforcement learning (IRL) denotes a powerful family of algorithms for recovering a
reward function justifying the behavior demonstrated by an expert agent. A well-known …

被引用次数：23 相关文章所有 10 个版本

[PDF] neurips.cc

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc

Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

被引用次数：34 相关文章所有 10 个版本

[PDF] mlr.press

Provably efficient learning of transferable rewards

AM Metelli, G Ramponi, A Concetti… - … on Machine Learning, 2021 - proceedings.mlr.press

The reward function is widely accepted as a succinct, robust, and transferable
representation of a task. Typical approaches, at the basis of Inverse Reinforcement Learning …

被引用次数：37 相关文章所有 5 个版本

[PDF] aaai.org

Efficient probabilistic performance bounds for inverse reinforcement learning

D Brown, S Niekum - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org

In the field of reinforcement learning there has been recent progress towards safety and high-
confidence bounds on policy performance. However, to our knowledge, no practical …

被引用次数：55 相关文章所有 13 个版本

[PDF] mlr.press

Learning from a learner

A Jacq, M Geist, A Paiva… - … Conference on Machine …, 2019 - proceedings.mlr.press

In this paper, we propose a novel setting for Inverse Reinforcement Learning (IRL), namely"
Learning from a Learner"(LfL). As opposed to standard IRL, it does not consist in learning a …

被引用次数：41 相关文章所有 8 个版本

[PDF] neurips.cc

Deep inverse Q-learning with constraints

G Kalweit, M Huegle, M Werling… - Advances in neural …, 2020 - proceedings.neurips.cc

Abstract Popular Maximum Entropy Inverse Reinforcement Learning approaches require the
computation of expected state visitation frequencies for the optimal policy under an estimate …

被引用次数：40 相关文章所有 6 个版本

高级搜索

QQ 群

Identifiability in inverse reinforcement learning

Identifiability and generalizability in constrained inverse reinforcement learning

Continuous inverse optimal control with locally optimal examples

Maximum likelihood constraint inference for inverse reinforcement learning

Towards theoretical understanding of inverse reinforcement learning

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

Provably efficient learning of transferable rewards

Efficient probabilistic performance bounds for inverse reinforcement learning

Learning from a learner

Deep inverse Q-learning with constraints

相关搜索

引用