查看文章

arxiv.org 中的 [PDF]

Exploring compact reinforcement-learning representations with linear regression

作者

Thomas J Walsh, István Szita, Carlos Diuk, Michael L Littman

发表日期

2009/6/18

研讨会论文

Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence

页码范围

591-598

出版商

AUAI Press

简介

This paper presents a new algorithm for online linear regression whose efficiency guarantees satisfy the requirements of the KWIK (Knows What It Knows) framework. The algorithm improves on the complexity bounds of the current state-of-the-art procedure in this setting. We explore several applications of this algorithm for learning compact reinforcement-learning representations. We show that KWIK linear regression can be used to learn the reward function of a factored MDP and the probabilities of action outcomes in Stochastic STRIPS and Object Oriented MDPs, none of which have been proven to be efficiently learnable in the RL setting before. We also combine KWIK linear regression with other KWIK learners to learn larger portions of these models, including experiments on learning factored MDP transition and reward functions together.

引用总数

被引用次数：133

201120122013201420152016201720182019202020212022202320249 13 11 7 6 8 13 8 8 7 10 6 11 4

学术搜索中的文章

Exploring compact reinforcement-learning representations with linear regression

TJ Walsh, I Szita, C Diuk, ML Littman - arXiv preprint arXiv:1205.2606, 2012

被引用次数：133 相关文章所有 20 个版本