Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

SDRL: interpretable and data-efficient deep reinforcement learning leveraging symbolic planning

D Lyu, F Yang, B Liu, S Gustafson - … of the AAAI Conference on Artificial …, 2019 - ojs.aaai.org
Deep reinforcement learning (DRL) has gained great success by learning directly from high-
dimensional sensory inputs, yet is notorious for the lack of interpretability. Interpretability of …

Reasoning about physical interactions with object-oriented prediction and planning

M Janner, S Levine, WT Freeman… - arXiv preprint arXiv …, 2018 - arxiv.org
Object-based factorizations provide a useful level of abstraction for interacting with the
world. Building explicit object representations, however, often requires supervisory signals …

Human-level reinforcement learning through theory-based modeling, exploration, and planning

PA Tsividis, J Loula, J Burga, N Foss… - arXiv preprint arXiv …, 2021 - arxiv.org
Reinforcement learning (RL) studies how an agent comes to achieve reward in an
environment through interactions over time. Recent advances in machine RL have …

Unsupervised object interaction learning with counterfactual dynamics models

J Choi, S Lee, X Wang, S Sohn, H Lee - Proceedings of the AAAI …, 2024 - ojs.aaai.org
We present COIL (Counterfactual Object Interaction Learning), a novel way of learning skills
of object interactions on entity-centric environments. The goal is to learn primitive behaviors …

Roll: Visual self-supervised reinforcement learning with object reasoning

Y Wang, N Gautham, X Lin… - Conference on Robot …, 2021 - proceedings.mlr.press
Current image-based reinforcement learning (RL) algorithms typically operate on the whole
image without performing object-level reasoning. This leads to inefficient goal sampling and …

Causal world models by unsupervised deconfounding of physical dynamics

M Li, M Yang, F Liu, X Chen, Z Chen… - arXiv preprint arXiv …, 2020 - arxiv.org
The capability of imagining internally with a mental model of the world is vitally important for
human cognition. If a machine intelligent agent can learn a world model to create a" dream" …

Learning abstract models for strategic exploration and fast reward transfer

EZ Liu, R Keramati, S Seshadri, K Guu… - arXiv preprint arXiv …, 2020 - arxiv.org
Model-based reinforcement learning (RL) is appealing because (i) it enables planning and
thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables …

Одновременное планирование и обучение в иерархической системе управления когнитивным агентом

АИ Панов - Автоматика и телемеханика, 2022 - mathnet.ru
Задачи планирования поведения и обучения принятию решений в динамической среде
в системах управления интеллектуальными агентами обычно разделяют и …

Learning and planning with logical automata

B Araki, K Vodrahalli, T Leech, CI Vasile… - Autonomous …, 2021 - Springer
We introduce a method to learn policies from expert demonstrations that are interpretable
and manipulable. We achieve interpretability by modeling the interactions between high …