Strategic object oriented reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer

Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

被引用次数：430 相关文章所有 9 个版本

[PDF] aaai.org

SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning

D Lyu, F Yang, B Liu, S Gustafson - … of the AAAI Conference on Artificial …, 2019 - ojs.aaai.org

Deep reinforcement learning (DRL) has gained great success by learning directly from high-
dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of …

被引用次数：194 相关文章所有 13 个版本

[PDF] arxiv.org

Reasoning about physical interactions with object-oriented prediction and planning

M Janner, S Levine, WT Freeman… - arXiv preprint arXiv …, 2018 - arxiv.org

Object-based factorizations provide a useful level of abstraction for interacting with the
world. Building explicit object representations, however, often requires supervisory signals …

被引用次数：142 相关文章所有 6 个版本

[PDF] arxiv.org

Human-level reinforcement learning through theory-based modeling, exploration, and planning

PA Tsividis, J Loula, J Burga, N Foss… - arXiv preprint arXiv …, 2021 - arxiv.org

Reinforcement learning (RL) studies how an agent comes to achieve reward in an
environment through interactions over time. Recent advances in machine RL have …

被引用次数：68 相关文章所有 2 个版本

[PDF] aaai.org

Unsupervised object interaction learning with counterfactual dynamics models

J Choi, S Lee, X Wang, S Sohn, H Lee - Proceedings of the AAAI …, 2024 - ojs.aaai.org

We present COIL (Counterfactual Object Interaction Learning), a novel way of learning skills
of object interactions on entity-centric environments. The goal is to learn primitive behaviors …

被引用次数：4 相关文章所有 2 个版本

[PDF] mlr.press

Roll: Visual self-supervised reinforcement learning with object reasoning

Y Wang, N Gautham, X Lin… - Conference on Robot …, 2021 - proceedings.mlr.press

Current image-based reinforcement learning (RL) algorithms typically operate on the whole
image without performing object-level reasoning. This leads to inefficient goal sampling and …

被引用次数：15 相关文章所有 5 个版本

[PDF] arxiv.org

Causal world models by unsupervised deconfounding of physical dynamics

M Li, M Yang, F Liu, X Chen, Z Chen… - arXiv preprint arXiv …, 2020 - arxiv.org

The capability of imagining internally with a mental model of the world is vitally important for
human cognition. If a machine intelligent agent can learn a world model to create a" dream" …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Learning abstract models for strategic exploration and fast reward transfer

EZ Liu, R Keramati, S Seshadri, K Guu… - arXiv preprint arXiv …, 2020 - arxiv.org

Model-based reinforcement learning (RL) is appealing because (i) it enables planning and
thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables …

被引用次数：7 相关文章

Одновременное планирование и обучение в иерархической системе управления когнитивным агентом

АИ Панов - Автоматика и телемеханика, 2022 - mathnet.ru

Задачи планирования поведения и обучения принятию решений в динамической среде
в системах управления интеллектуальными агентами обычно разделяют и …

被引用次数：5 相关文章所有 3 个版本

[PDF] springer.com

Learning and planning with logical automata

B Araki, K Vodrahalli, T Leech, CI Vasile… - Autonomous …, 2021 - Springer

We introduce a method to learn policies from expert demonstrations that are interpretable
and manipulable. We achieve interpretability by modeling the interactions between high …

被引用次数：2 相关文章所有 7 个版本

高级搜索

QQ 群