World models and predictive coding for cognitive and developmental robotics: Frontiers and challenges

T Taniguchi, S Murata, M Suzuki, D Ognibene… - Advanced …, 2023 - Taylor & Francis
Creating autonomous robots that can actively explore the environment, acquire knowledge
and learn skills continuously is the ultimate achievement envisioned in cognitive and …

Optimal goal-reaching reinforcement learning via quasimetric learning

T Wang, A Torralba, P Isola… - … Conference on Machine …, 2023 - proceedings.mlr.press
In goal-reaching reinforcement learning (RL), the optimal value function has a particular
geometry, called quasimetrics structure. This paper introduces Quasimetric Reinforcement …

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

C Zhu, M Simchowitz, S Gadipudi… - Advances in Neural …, 2024 - proceedings.neurips.cc
Visual model-based RL methods typically encode image observations into low-dimensional
representations in a manner that does not eliminate redundant information. This leaves them …

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu
Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

HarmonyDream: Task Harmonization Inside World Models

H Ma, J Wu, N Feng, C Xiao, D Li… - … on Machine Learning, 2024 - openreview.net
Model-based reinforcement learning (MBRL) holds the promise of sample-efficient learning
by utilizing a world model, which models how the environment works and typically …

Building minimal and reusable causal state abstractions for reinforcement learning

Z Wang, C Wang, X Xiao, Y Zhu, P Stone - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Two desiderata of reinforcement learning (RL) algorithms are the ability to learn from
relatively little experience and the ability to learn policies that generalize to a range of …

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc
Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

Latent state marginalization as a low-cost approach for improving exploration

D Zhang, A Courville, Y Bengio, Q Zheng… - arXiv preprint arXiv …, 2022 - arxiv.org
While the maximum entropy (MaxEnt) reinforcement learning (RL) framework--often touted
for its exploration and robustness capabilities--is usually motivated from a probabilistic …

Learning world models with identifiable factorization

Y Liu, B Huang, Z Zhu, H Tian… - Advances in Neural …, 2023 - proceedings.neurips.cc
Extracting a stable and compact representation of the environment is crucial for efficient
reinforcement learning in high-dimensional, noisy, and non-stationary environments …

Bridging State and History Representations: Understanding Self-Predictive RL

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …