[HTML][HTML] Exploring deep reinforcement learning with multi q-learning

E Duryea, M Ganger, W Hu - Intelligent Control and Automation, 2016 - scirp.org
Q-learning is a popular temporal-difference reinforcement learning algorithm which often
explicitly stores state values using lookup tables. This implementation has been proven to …

[HTML][HTML] Exploring Deep Reinforcement Learning with Multi Q-Learning

E Duryea, M Ganger, W Hu - Intelligent Control and Automation, 2016 - scirp.org
Q-learning is a popular temporal-difference reinforcement learning algorithm which often
explicitly stores state values using lookup tables. This implementation has been proven to …

[PDF][PDF] Exploring Deep Reinforcement Learning with Multi Q-Learning

E Duryea, M Ganger, W Hu - 2016 - scholar.archive.org
Q-learning is a popular temporal-difference reinforcement learning algorithm which often
explicitly stores state values using lookup tables. This implementation has been proven to …

[PDF][PDF] Exploring Deep Reinforcement Learning with Multi Q-Learning

E Duryea, M Ganger, W Hu - 2016 - pdfs.semanticscholar.org
Q-learning is a popular temporal-difference reinforcement learning algorithm which often
explicitly stores state values using lookup tables. This implementation has been proven to …

[PDF][PDF] Exploring Deep Reinforcement Learning with Multi Q-Learning

E Duryea, M Ganger, W Hu - 2016 - file.scirp.org
Q-learning is a popular temporal-difference reinforcement learning algorithm which often
explicitly stores state values using lookup tables. This implementation has been proven to …