- 学术资源搜索

Fitvid: Overfitting in pixel-level video prediction

M Babaeizadeh, MT Saffar, S Nair, S Levine… - arXiv preprint arXiv …, 2021 - arxiv.org

An agent that is capable of predicting what happens next can perform a variety of tasks
through planning with no additional training. Furthermore, such an agent can internally …

被引用次数：75 相关文章所有 3 个版本

[PDF] thecvf.com

Pathdreamer: A world model for indoor navigation

JY Koh, H Lee, Y Yang, J Baldridge… - Proceedings of the …, 2021 - openaccess.thecvf.com

People navigating in unfamiliar buildings take advantage of myriad visual, spatial and
semantic cues to efficiently achieve their navigation goals. Towards equipping …

被引用次数：66 相关文章所有 10 个版本

[PDF] arxiv.org

Visual spatial attention and proprioceptive data-driven reinforcement learning for robust peg-in-hole task under variable conditions

AY Yasutomi, H Ichiwara, H Ito, H Mori… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

Anchor-bolt insertion is a peg-in-hole task performed in the construction field for holes in
concrete. Efforts have been made to automate this task, but the variable lighting and hole …

被引用次数：13 相关文章所有 7 个版本

[PDF] arxiv.org

Dreamsmooth: Improving model-based reinforcement learning via reward smoothing

V Lee, P Abbeel, Y Lee - arXiv preprint arXiv:2311.01450, 2023 - arxiv.org

Model-based reinforcement learning (MBRL) has gained much attention for its ability to
learn complex behaviors in a sample-efficient way: planning actions by generating …

被引用次数：2 相关文章所有 3 个版本

[PDF] github.io

Replay overshooting: Learning stochastic latent dynamics with the extended kalman filter

AH Li, P Wu, M Kennedy - 2021 IEEE international conference …, 2021 - ieeexplore.ieee.org

This paper presents replay overshooting (RO), an algorithm that uses properties of the
extended Kalman filter (EKF) to learn nonlinear stochastic latent dynamics models suitable …

被引用次数：8 相关文章所有 5 个版本

[PDF] ejournal.org.cn

[PDF][PDF] 基于无标签视频数据的深度预测学习方法综述

潘敏婷，王韫博，朱祥明，高思宇，龙明盛，杨小康 - 电子学报, 2022 - ejournal.org.cn

基于视频数据的深度预测学习(以下简称“深度预测学习”) 属于深度学习, 计算机视觉和强化学习
的交叉融合研究方向, 是气象预报, 自动驾驶, 机器人视觉控制等场景下智能预测与决策系统的 …

被引用次数：1 相关文章所有 3 个版本

[PDF] openreview.net

Blast: Latent dynamics models from bootstrapping

K Paster, LE McKinney, SA McIlraith… - Deep RL Workshop …, 2021 - openreview.net

State-of-the-art world models such as DreamerV2 have significantly improved the
capabilities of model-based reinforcement learning. However, these approaches typically …

被引用次数：5 相关文章

[PDF] escholarship.org

[图书][B] Synergy of Prediction and Control in Model-based Reinforcement Learning

NO Lambert - 2022 - search.proquest.com

Abstract Model-based reinforcement learning (MBRL) has often been touted for its potential
to improve on the sample-efficiency, generalization, and safety of existing reinforcement …

被引用次数：1 相关文章所有 3 个版本

Model-Based Reinforcement Learning

A Plaat - Deep Reinforcement Learning, 2022 - Springer

The previous chapters discussed model-free methods, and we saw their success in video
games and simulated robotics. In model-free methods, the agent updates a policy directly …

被引用次数：1 相关文章

[PDF] nii.ac.jp

[PDF][PDF] Deep Reinforcement Learning Adapted to Real-World Training Data Limitations

AY Yasutomi - 2023 - waseda.repo.nii.ac.jp

Abstract Deep Reinforcement Learning (DRL) is a rapidly growing area of research in the
field of artificial intelligence, which has shown exceptional success in solving complex tasks …

高级搜索

QQ 群