查看文章

arxiv.org 中的 [PDF]

Reasoning and generalization in rl: A tool use perspective

作者

Sam Wenke, Dan Saunders, Mike Qiu, Jim Fleming

发表日期

2019/7/3

期刊

arXiv preprint arXiv:1907.02050

简介

Learning to use tools to solve a variety of tasks is an innate ability of humans and has been observed of animals in the wild. However, the underlying mechanisms that are required to learn to use tools are abstract and widely contested in the literature. In this paper, we study tool use in the context of reinforcement learning and propose a framework for analyzing generalization inspired by a classic study of tool using behavior, the trap-tube task. Recently, it has become common in reinforcement learning to measure generalization performance on a single test set of environments. We instead propose transfers that produce multiple test sets that are used to measure specified types of generalization, inspired by abilities demonstrated by animal and human tool users. The source code to reproduce our experiments is publicly available at https://github.com/fomorians/gym_tool_use.

引用总数

被引用次数：7

201920202021202220231 3 1 2

学术搜索中的文章

Reasoning and generalization in rl: A tool use perspective

S Wenke, D Saunders, M Qiu, J Fleming - arXiv preprint arXiv:1907.02050, 2019

被引用次数：7 相关文章所有 2 个版本