Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural …, 2022 - proceedings.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - arXiv preprint arXiv:2210.01808, 2022 - arxiv.org
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

[PDF][PDF] Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - papers.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - arXiv e-prints, 2022 - ui.adsabs.harvard.edu
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - Advances in Neural Information …, 2022 - openreview.net
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, A Garcia, C Li, M Hong - 36th Conference on Neural …, 2022 - experts.umn.edu
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

[PDF][PDF] Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - proceedings.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum-likelihood inverse reinforcement learning with finite-time guarantees

S Zeng, C Li, A Garcia, M Hong - … of the 36th International Conference on …, 2022 - dl.acm.org
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - Decision Awareness in Reinforcement … - openreview.net
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …

[PDF][PDF] Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

S Zeng, C Li, A Garcia, M Hong - papers.neurips.cc
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated
optimal policy that best fits observed sequences of states and actions implemented by an …