[HTML][HTML] Deep learning, reinforcement learning, and world models

Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver… - Neural Networks, 2022 - Elsevier
Deep learning (DL) and reinforcement learning (RL) methods seem to be a part of
indispensable factors to achieve human-level or super-human AI systems. On the other …

Identifiability and generalizability in constrained inverse reinforcement learning

A Schlaginhaufen… - … Conference on Machine …, 2023 - proceedings.mlr.press
Two main challenges in Reinforcement Learning (RL) are designing appropriate reward
functions and ensuring the safety of the learned policy. To address these challenges, we …

Distributed inverse constrained reinforcement learning for multi-agent systems

S Liu, M Zhu - Advances in Neural Information Processing …, 2022 - proceedings.neurips.cc
This paper considers the problem of recovering the policies of multiple interacting experts by
estimating their reward functions and constraints where the demonstration data of the …

Offline inverse reinforcement learning

F Jarboui, V Perchet - arXiv preprint arXiv:2106.05068, 2021 - arxiv.org
The objective of offline RL is to learn optimal policies when a fixed exploratory
demonstrations data-set is available and sampling additional observations is impossible …

Curricular Subgoals for Inverse Reinforcement Learning

S Liu, Y Qing, S Xu, H Wu, J Zhang, J Cong… - arXiv preprint arXiv …, 2023 - arxiv.org
Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert
demonstrations to facilitate policy learning, and has demonstrated its remarkable success in …

Robust imitation via mirror descent inverse reinforcement learning

DS Han, H Kim, H Lee, JH Ryu… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recently, adversarial imitation learning has shown a scalable reward acquisition method for
inverse reinforcement learning (IRL) problems. However, estimated reward signals often …

Multi Task Inverse Reinforcement Learning for Common Sense Reward

N Glazer, A Navon, A Shamsian, E Fetaya - arXiv preprint arXiv …, 2024 - arxiv.org
One of the challenges in applying reinforcement learning in a complex real-world
environment lies in providing the agent with a sufficiently detailed reward function. Any …

Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning

A Schlaginhaufen, M Kamgarpour - arXiv preprint arXiv:2406.01793, 2024 - arxiv.org
Inverse reinforcement learning (IRL) aims to infer a reward from expert demonstrations,
motivated by the idea that the reward, rather than the policy, is the most succinct and …

逆强化学习算法, 理论与应用研究综述

宋莉, 李大字, 徐昕 - 自动化学报, 2023 - aas.net.cn
随着深度强化学习的研究与发展, 强化学习在博弈与优化决策, 智能驾驶等现实问题中的应用也
取得显著进展. 然而强化学习在智能体与环境的交互中存在人工设计奖励函数难的问题 …

A generalised inverse reinforcement learning framework

F Jarboui, V Perchet - arXiv preprint arXiv:2105.11812, 2021 - arxiv.org
The gloabal objective of inverse Reinforcement Learning (IRL) is to estimate the unknown
cost function of some MDP base on observed trajectories generated by (approximate) …