J Murata - Robotics, automation and control, book, 2008 - books.google.com
Reinforcement learning (Kaelbling et al., 1996; Sutton & Barto, 1998) is a machine learning
technique that automatically acquires a good action policy, ie a mapping from the current …