We consider the problem of imitation learning where the examples, demonstrated by an expert, cover only a small part of a large state space. Inverse Reinforcement Learning (IRL) …
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an …
Z Huang, J Wu, C Lv - IEEE transactions on intelligent …, 2021 - ieeexplore.ieee.org
Driving behavior modeling is of great importance for designing safe, smart, and personalized autonomous driving systems. In this paper, an internal reward function-based …
Autonomous vehicles promise to improve traffic safety while, at the same time, increase fuel efficiency and reduce congestion. They represent the main trend in future intelligent …
M Naumann, L Sun, W Zhan… - 2020 IEEE international …, 2020 - ieeexplore.ieee.org
Autonomous vehicles are sharing the road with human drivers. In order to facilitate interactive driving and cooperative behavior in dense traffic, a thorough understanding and …
Z Zhao, Z Wang, K Han, R Gupta… - … on Robotics and …, 2022 - ieeexplore.ieee.org
Driving automation is gradually replacing human driving maneuvers in different applications such as adaptive cruise control and lane keeping. However, contemporary driving …
P Wang, D Liu, J Chen, H Li… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Making decisions in complex driving environments is a challenging task for autonomous agents. Imitation learning methods have great potentials for achieving such a goal …
D Lindner, A Krause… - Advances in Neural …, 2022 - proceedings.neurips.cc
Abstract Inverse Reinforcement Learning (IRL) is a powerful paradigm for inferring a reward function from expert demonstrations. Many IRL algorithms require a known transition model …
A Majumdar, S Singh… - … science and systems, 2017 - m.roboticsproceedings.org
The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, ie, that humans are risk …