查看文章

Hierarchical Reinforcement Learning.

作者

Carlos Diuk, Michael L Littman

发表日期

2009

图书

Encyclopedia of Artificial Intelligence

页码范围

825-830

简介

Reinforcement learning (RL) deals with the problem of an agent that has to learn how to behave to maximize its utility by its interactions with an environment (Sutton & Barto, 1998; Kaelbling, Littman & Moore, 1996). Reinforcement learning problems are usually formalized as Markov Decision Processes (MDP), which consist of a finite set of states and a finite number of possible actions that the agent can perform. At any given point in time, the agent is in a certain state and picks an action. It can then observe the new state this action leads to, and receives a reward signal. The goal of the agent is to maximize its long-term reward. In this standard formalization, no particular structure or relationship between states is assumed. However, learning in environments with extremely large state spaces is infeasible without some form of generalization. Exploiting the underlying structure of a problem can effect generalization and …

引用总数

被引用次数：5

2010201120122013201420152016201720182019202020212022202320241 1 1 1 1

学术搜索中的文章

Hierarchical reinforcement learning

C Diuk, M Littman - Encyclopedia of artificial intelligence, 2009

被引用次数：5 相关文章