Improvisation through physical understanding: Using novel objects as tools with visual foresight

J Ibarz, J Tan, C Finn, M Kalakrishnan… - … Journal of Robotics …, 2021 - journals.sagepub.com

Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …

被引用次数：533 相关文章所有 7 个版本

[PDF] arxiv.org

A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org

The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

被引用次数：264 相关文章所有 14 个版本

[PDF] mlr.press

Daydreamer: World models for physical robot learning

P Wu, A Escontrela, D Hafner… - … on robot learning, 2023 - proceedings.mlr.press

To solve tasks in complex environments, robots need to learn from experience. Deep
reinforcement learning is a common approach to robot learning but requires a large amount …

被引用次数：183 相关文章所有 9 个版本

[PDF] arxiv.org

Emergent tool use from multi-agent autocurricula

B Baker, I Kanitscheider, T Markov, Y Wu… - arXiv preprint arXiv …, 2019 - arxiv.org

Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …

被引用次数：809 相关文章所有 3 个版本

[PDF] arxiv.org

Robonet: Large-scale multi-robot learning

S Dasari, F Ebert, S Tian, S Nair, B Bucher… - arXiv preprint arXiv …, 2019 - arxiv.org

Robot learning has emerged as a promising tool for taming the complexity and diversity of
the real world. Methods based on high-capacity models, such as deep networks, hold the …

被引用次数：270 相关文章所有 6 个版本

[PDF] arxiv.org

Mt-opt: Continuous multi-task robotic reinforcement learning at scale

D Kalashnikov, J Varley, Y Chebotar… - arXiv preprint arXiv …, 2021 - arxiv.org

General-purpose robotic systems must master a large repertoire of diverse skills to be useful
in a range of daily tasks. While reinforcement learning provides a powerful framework for …

被引用次数：141 相关文章所有 2 个版本

[PDF] thecvf.com

Greedy hierarchical variational autoencoders for large-scale video prediction

B Wu, S Nair, R Martin-Martin… - Proceedings of the …, 2021 - openaccess.thecvf.com

A video prediction model that generalizes to diverse scenes would enable intelligent agents
such as robots to perform a variety of tasks via planning with the model. However, while …

被引用次数：114 相关文章所有 5 个版本

[PDF] mlr.press

How to leverage unlabeled data in offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - International …, 2022 - proceedings.mlr.press

Offline reinforcement learning (RL) can learn control policies from static datasets but, like
standard RL methods, it requires reward annotations for every transition. In many cases …

被引用次数：59 相关文章所有 5 个版本

[PDF] arxiv.org

Parrot: Data-driven behavioral priors for reinforcement learning

A Singh, H Liu, G Zhou, A Yu, N Rhinehart… - arXiv preprint arXiv …, 2020 - arxiv.org

Reinforcement learning provides a general framework for flexible decision making and
control, but requires extensive data collection for each new task that an agent needs to …

被引用次数：134 相关文章所有 3 个版本

[PDF] neurips.cc

Conservative data sharing for multi-task offline reinforcement learning

T Yu, A Kumar, Y Chebotar… - Advances in …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) algorithms have shown promising results in domains
where abundant pre-collected data is available. However, prior methods focus on solving …

被引用次数：75 相关文章所有 6 个版本

高级搜索

QQ 群