[图书][B] Reinforcement learning and optimal control

D Bertsekas - 2019 - books.google.com
This book considers large and challenging multistage decision problems, which can be
solved in principle by dynamic programming (DP), but their exact solution is computationally …

[图书][B] Rollout, policy iteration, and distributed reinforcement learning

D Bertsekas - 2021 - books.google.com
The purpose of this book is to develop in greater depth some of the methods from the
author's Reinforcement Learning and Optimal Control recently published textbook (Athena …

[图书][B] Reinforcement learning and dynamic programming using function approximators

L Busoniu, R Babuska, B De Schutter, D Ernst - 2017 - taylorfrancis.com
From household appliances to applications in robotics, engineered systems involving
complex dynamics can only be as effective as the algorithms that control them. While …

[图书][B] Algorithms for reinforcement learning

C Szepesvári - 2022 - books.google.com
Reinforcement learning is a learning paradigm concerned with learning to control a system
so as to maximize a numerical performance measure that expresses a long-term objective …

[图书][B] Abstract dynamic programming

D Bertsekas - 2022 - books.google.com
This is the 3rd edition of a research monograph providing a synthesis of old research on the
foundations of dynamic programming (DP), with the modern theory of approximate DP and …

[图书][B] Dynamic programming and optimal control: Volume I

D Bertsekas - 2012 - books.google.com
This is the leading and most up-to-date textbook on the far-ranging algorithmic
methododogy of Dynamic Programming, which can be used for optimal control, Markovian …

Convergence results for single-step on-policy reinforcement-learning algorithms

S Singh, T Jaakkola, ML Littman, C Szepesvári - Machine learning, 2000 - Springer
An important application of reinforcement learning (RL) is to finite-state control problems and
one of the most difficult problems in learning for control is balancing the exploration …

[图书][B] Lessons from AlphaZero for optimal, model predictive, and adaptive control

D Bertsekas - 2022 - books.google.com
The purpose of this book is to propose and develop a new conceptual framework for
approximate Dynamic Programming (DP) and Reinforcement Learning (RL). This framework …

Reinforcement Learning: An Introduction. By Richard's Sutton

AG Barto - SIAM Rev, 2021 - SIAM
Reinforcement learning (RL) is a set of mathematical methods and algorithms that can be
applied to a wide array of problems and plays a central role in machine learning. The aim of …

Linearly-solvable Markov decision problems

E Todorov - Advances in neural information processing …, 2006 - proceedings.neurips.cc
We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have
discrete state spaces and continuous control spaces. The controls have the effect of …