[PDF][PDF] Deep learning in neural networks: An overview

J Schmidhuber - 2015 - modl.sites.umassd.edu
In recent years, deep artificial neural networks (including recurrent ones) have won
numerous contests in pattern recognition and machine learning. This historical survey …

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

RS Sutton, D Precup, S Singh - Artificial intelligence, 1999 - Elsevier
Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key, longstanding challenges for AI. In this paper we consider how these challenges can …

Formal theory of creativity, fun, and intrinsic motivation (1990–2010)

J Schmidhuber - IEEE transactions on autonomous mental …, 2010 - ieeexplore.ieee.org
The simple, but general formal theory of fun and intrinsic motivation and creativity (1990-
2010) is based on the concept of maximizing intrinsic reward for the active creation or …

Learning complex, extended sequences using the principle of history compression

J Schmidhuber - Neural computation, 1992 - ieeexplore.ieee.org
Previous neural network learning algorithms for sequence processing are computationally
expensive and perform poorly when it comes to long time lags. This paper first introduces a …

[图书][B] Continual learning in reinforcement environments

MB Ring - 1994 - search.proquest.com
Continual learning is the constant development of complex behaviors with no final end in
mind. It is the process of learning ever more complicated skills by building on those skills …

[PDF][PDF] Neural net architectures for temporal sequence processing

MC Mozer - Santa Fe Institute Studies in the Sciences of …, 1993 - researchgate.net
I present a general taxonomy of neural net architectures for processing time-varying
patterns. This taxonomy subsumes many existing architectures in the literature, and points to …

On learning to think: Algorithmic information theory for novel combinations of reinforcement learning controllers and recurrent neural world models

J Schmidhuber - arXiv preprint arXiv:1511.09249, 2015 - arxiv.org
This paper addresses the general problem of reinforcement learning (RL) in partially
observable environments. In 2013, our large RL recurrent neural networks (RNNs) learned …

TD models: Modeling the world at a mixture of time scales

RS Sutton - Machine Learning Proceedings 1995, 1995 - Elsevier
Temporal-difference (TD) learning can be used not just to predict rewards, as is commonly
done in reinforcement learning, but also to predict states, ie, to learn a model of the world's …

[图书][B] Learning action models for reactive autonomous agents

SS Benson - 1997 - search.proquest.com
To be maximally effective, autonomous agents such as robots must be able both to react
appropriately in dynamic environments and to plan new courses of action in novel situations …

Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales

RS Sutton - 1998 - scholarworks.umass.edu
Learning, planning, and representing knowledge at multiple levels of temporal abstraction
are key challenges for AI. In this paper we develop an approach to these problems based on …