Fitvid: Overfitting in pixel-level video prediction

M Babaeizadeh, MT Saffar, S Nair, S Levine… - arXiv preprint arXiv …, 2021 - arxiv.org
An agent that is capable of predicting what happens next can perform a variety of tasks
through planning with no additional training. Furthermore, such an agent can internally …

Pathdreamer: A world model for indoor navigation

JY Koh, H Lee, Y Yang, J Baldridge… - Proceedings of the …, 2021 - openaccess.thecvf.com
People navigating in unfamiliar buildings take advantage of myriad visual, spatial and
semantic cues to efficiently achieve their navigation goals. Towards equipping …

Visual spatial attention and proprioceptive data-driven reinforcement learning for robust peg-in-hole task under variable conditions

AY Yasutomi, H Ichiwara, H Ito, H Mori… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Anchor-bolt insertion is a peg-in-hole task performed in the construction field for holes in
concrete. Efforts have been made to automate this task, but the variable lighting and hole …

Dreamsmooth: Improving model-based reinforcement learning via reward smoothing

V Lee, P Abbeel, Y Lee - arXiv preprint arXiv:2311.01450, 2023 - arxiv.org
Model-based reinforcement learning (MBRL) has gained much attention for its ability to
learn complex behaviors in a sample-efficient way: planning actions by generating …

Replay overshooting: Learning stochastic latent dynamics with the extended kalman filter

AH Li, P Wu, M Kennedy - 2021 IEEE international conference …, 2021 - ieeexplore.ieee.org
This paper presents replay overshooting (RO), an algorithm that uses properties of the
extended Kalman filter (EKF) to learn nonlinear stochastic latent dynamics models suitable …

[PDF][PDF] 基于无标签视频数据的深度预测学习方法综述

潘敏婷, 王韫博, 朱祥明, 高思宇, 龙明盛, 杨小康 - 电子学报, 2022 - ejournal.org.cn
基于视频数据的深度预测学习(以下简称“深度预测学习”) 属于深度学习, 计算机视觉和强化学习
的交叉融合研究方向, 是气象预报, 自动驾驶, 机器人视觉控制等场景下智能预测与决策系统的 …

Blast: Latent dynamics models from bootstrapping

K Paster, LE McKinney, SA McIlraith… - Deep RL Workshop …, 2021 - openreview.net
State-of-the-art world models such as DreamerV2 have significantly improved the
capabilities of model-based reinforcement learning. However, these approaches typically …

[图书][B] Synergy of Prediction and Control in Model-based Reinforcement Learning

NO Lambert - 2022 - search.proquest.com
Abstract Model-based reinforcement learning (MBRL) has often been touted for its potential
to improve on the sample-efficiency, generalization, and safety of existing reinforcement …

Model-Based Reinforcement Learning

A Plaat - Deep Reinforcement Learning, 2022 - Springer
The previous chapters discussed model-free methods, and we saw their success in video
games and simulated robotics. In model-free methods, the agent updates a policy directly …

[PDF][PDF] Deep Reinforcement Learning Adapted to Real-World Training Data Limitations

AY Yasutomi - 2023 - waseda.repo.nii.ac.jp
Abstract Deep Reinforcement Learning (DRL) is a rapidly growing area of research in the
field of artificial intelligence, which has shown exceptional success in solving complex tasks …