Incremental development of complex behaviors through automatic construction of sensory-motor...

J Schmidhuber - 2015 - modl.sites.umassd.edu

In recent years, deep artificial neural networks (including recurrent ones) have won
numerous contests in pattern recognition and machine learning. This historical survey …

被引用次数：24047 相关文章

[PDF] sciencedirect.com

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

RS Sutton, D Precup, S Singh - Artificial intelligence, 1999 - Elsevier

Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key, longstanding challenges for AI. In this paper we consider how these challenges can …

被引用次数：4701 相关文章所有 39 个版本

[PDF] psu.edu

Formal theory of creativity, fun, and intrinsic motivation (1990–2010)

J Schmidhuber - IEEE transactions on autonomous mental …, 2010 - ieeexplore.ieee.org

The simple, but general formal theory of fun and intrinsic motivation and creativity (1990-
2010) is based on the concept of maximizing intrinsic reward for the active creation or …

被引用次数：1086 相关文章所有 11 个版本

[PDF] tum.de

Learning complex, extended sequences using the principle of history compression

J Schmidhuber - Neural computation, 1992 - ieeexplore.ieee.org

Previous neural network learning algorithms for sequence processing are computationally
expensive and perform poorly when it comes to long time lags. This paper first introduces a …

被引用次数：680 相关文章所有 9 个版本

[PDF] psu.edu

[图书][B] Continual learning in reinforcement environments

MB Ring - 1994 - search.proquest.com

Continual learning is the constant development of complex behaviors with no final end in
mind. It is the process of learning ever more complicated skills by building on those skills …

被引用次数：396 相关文章所有 8 个版本

[PDF] researchgate.net

[PDF][PDF] Neural net architectures for temporal sequence processing

MC Mozer - Santa Fe Institute Studies in the Sciences of …, 1993 - researchgate.net

I present a general taxonomy of neural net architectures for processing time-varying
patterns. This taxonomy subsumes many existing architectures in the literature, and points to …

被引用次数：455 相关文章所有 9 个版本

[PDF] arxiv.org

On learning to think: Algorithmic information theory for novel combinations of reinforcement learning controllers and recurrent neural world models

J Schmidhuber - arXiv preprint arXiv:1511.09249, 2015 - arxiv.org

This paper addresses the general problem of reinforcement learning (RL) in partially
observable environments. In 2013, our large RL recurrent neural networks (RNNs) learned …

被引用次数：134 相关文章所有 2 个版本

[PDF] psu.edu

TD models: Modeling the world at a mixture of time scales

RS Sutton - Machine Learning Proceedings 1995, 1995 - Elsevier

Temporal-difference (TD) learning can be used not just to predict rewards, as is commonly
done in reinforcement learning, but also to predict states, ie, to learn a model of the world's …

被引用次数：224 相关文章所有 10 个版本

[PDF] stanford.edu

[图书][B] Learning action models for reactive autonomous agents

SS Benson - 1997 - search.proquest.com

To be maximally effective, autonomous agents such as robots must be able both to react
appropriately in dynamic environments and to plan new courses of action in novel situations …

被引用次数：119 相关文章所有 13 个版本

[PDF] umass.edu

Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales

RS Sutton - 1998 - scholarworks.umass.edu

Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key challenges for AI. In this paper we develop an approach to these problems based on …

被引用次数：125 相关文章所有 22 个版本

高级搜索

QQ 群