Denoised mdps: Learning world models better than the world itself

World models and predictive coding for cognitive and developmental robotics: Frontiers and challenges

T Taniguchi, S Murata, M Suzuki, D Ognibene… - Advanced …, 2023 - Taylor & Francis

Creating autonomous robots that can actively explore the environment, acquire knowledge
and learn skills continuously is the ultimate achievement envisioned in cognitive and …

被引用次数：47 相关文章所有 14 个版本

[PDF] mlr.press

Optimal goal-reaching reinforcement learning via quasimetric learning

T Wang, A Torralba, P Isola… - … Conference on Machine …, 2023 - proceedings.mlr.press

In goal-reaching reinforcement learning (RL), the optimal value function has a particular
geometry, called quasimetrics structure. This paper introduces Quasimetric Reinforcement …

被引用次数：19 相关文章所有 7 个版本

[PDF] neurips.cc

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

C Zhu, M Simchowitz, S Gadipudi… - Advances in Neural …, 2024 - proceedings.neurips.cc

Visual model-based RL methods typically encode image observations into low-dimensional
representations in a manner that does not eliminate redundant information. This leaves them …

被引用次数：4 相关文章所有 5 个版本

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arXiv preprint arXiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

被引用次数：10 相关文章所有 2 个版本

[PDF] openreview.net

HarmonyDream: Task Harmonization Inside World Models

H Ma, J Wu, N Feng, C Xiao, D Li… - … on Machine Learning, 2024 - openreview.net

Model-based reinforcement learning (MBRL) holds the promise of sample-efficient learning
by utilizing a world model, which models how the environment works and typically …

被引用次数：1 相关文章

[PDF] aaai.org

Building minimal and reusable causal state abstractions for reinforcement learning

Z Wang, C Wang, X Xiao, Y Zhu, P Stone - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Two desiderata of reinforcement learning (RL) algorithms are the ability to learn from
relatively little experience and the ability to learn policies that generalize to a range of …

被引用次数：3 相关文章所有 6 个版本

[PDF] neurips.cc

Ignorance is bliss: Robust control via information gating

M Tomar, R Islam, M Taylor… - Advances in Neural …, 2023 - proceedings.neurips.cc

Informational parsimony provides a useful inductive bias for learning representations that
achieve better generalization by being robust to noise and spurious correlations. We …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Latent state marginalization as a low-cost approach for improving exploration

D Zhang, A Courville, Y Bengio, Q Zheng… - arXiv preprint arXiv …, 2022 - arxiv.org

While the maximum entropy (MaxEnt) reinforcement learning (RL) framework--often touted
for its exploration and robustness capabilities--is usually motivated from a probabilistic …

被引用次数：11 相关文章所有 5 个版本

[PDF] neurips.cc

Learning world models with identifiable factorization

Y Liu, B Huang, Z Zhu, H Tian… - Advances in Neural …, 2023 - proceedings.neurips.cc

Extracting a stable and compact representation of the environment is crucial for efficient
reinforcement learning in high-dimensional, noisy, and non-stationary environments …

被引用次数：3 相关文章所有 7 个版本

[PDF] arxiv.org

Bridging State and History Representations: Understanding Self-Predictive RL

T Ni, B Eysenbach, E Seyedsalehi, M Ma… - arXiv preprint arXiv …, 2024 - arxiv.org

Representations are at the core of all deep reinforcement learning (RL) methods for both
Markov decision processes (MDPs) and partially observable Markov decision processes …

被引用次数：5 相关文章所有 4 个版本

高级搜索

QQ 群