A bayesian approach to generative adversarial imitation learning

M Zare, PM Kebria, A Khosravi… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In recent years, the development of robotics and artificial intelligence (AI) systems has been
nothing short of remarkable. As these systems continue to evolve, they are being utilized in …

被引用次数：62 相关文章所有 2 个版本

[PDF] neurips.cc

Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability

D Ghosh, J Rahme, A Kumar, A Zhang… - Advances in neural …, 2021 - proceedings.neurips.cc

Generalization is a central challenge for the deployment of reinforcement learning (RL)
systems in the real world. In this paper, we show that the sequential structure of the RL …

被引用次数：122 相关文章所有 10 个版本

[PDF] mlr.press

Offline rl policies should be trained to be adaptive

D Ghosh, A Ajay, P Agrawal… - … Conference on Machine …, 2022 - proceedings.mlr.press

Offline RL algorithms must account for the fact that the dataset they are provided may leave
many facets of the environment unknown. The most common way to approach this challenge …

被引用次数：48 相关文章所有 5 个版本

[PDF] nature.com

Hybrid hierarchical learning for solving complex sequential tasks using the robotic manipulation network ROMAN

E Triantafyllidis, F Acero, Z Liu, Z Li - Nature Machine Intelligence, 2023 - nature.com

Solving long sequential tasks remains a non-trivial challenge in the field of embodied
artificial intelligence. Enabling a robotic system to perform diverse sequential tasks with a …

被引用次数：17 相关文章所有 7 个版本

[PDF] arxiv.org

Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions

J Chen, B Ganguly, Y Xu, Y Mei, T Lan… - arXiv preprint arXiv …, 2024 - arxiv.org

Deep generative models (DGMs) have demonstrated great success across various domains,
particularly in generating texts, images, and videos using models trained from offline data …

被引用次数：10 相关文章所有 3 个版本

[PDF] mlr.press

Variational inference mpc for bayesian model-based reinforcement learning

M Okada, T Taniguchi - Conference on robot learning, 2020 - proceedings.mlr.press

In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty
in forward dynamics is a state-of-the-art strategy to enhance learning performance, making …

被引用次数：79 相关文章所有 4 个版本

[PDF] mlr.press

Inverse decision modeling: Learning interpretable representations of behavior

D Jarrett, A Hüyük… - … Conference on Machine …, 2021 - proceedings.mlr.press

Decision analysis deals with modeling and enhancing decision processes. A principal
challenge in improving behavior is in obtaining a transparent* description* of existing …

被引用次数：32 相关文章所有 7 个版本

[PDF] neurips.cc

Strictly batch imitation learning by energy-based distribution matching

D Jarrett, I Bica… - Advances in Neural …, 2020 - proceedings.neurips.cc

Consider learning a policy purely on the basis of demonstrated behavior---that is, with no
access to reinforcement signals, no knowledge of transition dynamics, and no further …

被引用次数：65 相关文章所有 9 个版本

[PDF] mlr.press

Inverse constrained reinforcement learning

S Malik, U Anwar, A Aghasi… - … conference on machine …, 2021 - proceedings.mlr.press

In real world settings, numerous constraints are present which are hard to specify
mathematically. However, for the real world deployment of reinforcement learning (RL), it is …

被引用次数：59 相关文章所有 3 个版本

[PDF] thecvf.com

Learning by watching

J Zhang, E Ohn-Bar - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com

When in a new situation or geographical location, human drivers have an extraordinary
ability to watch others and learn maneuvers that they themselves may have never …

被引用次数：43 相关文章所有 6 个版本

高级搜索

QQ 群