作者
Carlos Diuk, Andre Cohen, Michael L Littman
发表日期
2008/7/5
研讨会论文
Proceedings of the 25th international conference on Machine learning
页码范围
240-247
出版商
ACM
简介
Rich representations in reinforcement learning have been studied for the purpose of enabling generalization and making learning feasible in large state spaces. We introduce Object-Oriented MDPs (OO-MDPs), a representation based on objects and their interactions, which is a natural way of modeling environments and offers important generalization opportunities. We introduce a learning algorithm for deterministic OO-MDPs and prove a polynomial bound on its sample complexity. We illustrate the performance gains of our representation and algorithm in the well-known Taxi domain, plus a real-life videogame.
引用总数
2008200920102011201220132014201520162017201820192020202120222023202411151018101320122440335147413215
学术搜索中的文章
C Diuk, A Cohen, ML Littman - Proceedings of the 25th international conference on …, 2008