Q-learning with experience replay in a dynamic environment

M Pieters, MA Wiering - 2016 IEEE Symposium Series on …, 2016 - ieeexplore.ieee.org
Most research in reinforcement learning has focused on stationary environments. In this
paper, we propose several adaptations of Q-learning for a dynamic environment, for both …

DQL: A new updating strategy for reinforcement learning based on Q-learning

CE Mariano, EF Morales - Machine Learning: ECML 2001: 12th European …, 2001 - Springer
In reinforcement learning an autonomous agent learns an optimal policy while interacting
with the environment. In particular, in one-step Q-learning, with each action an agent …

Improving the performance of q-learning using simultanouse q-values updating

M Pouyan, A Mousavi, S Golzari… - … and Knowledge (ICTCK …, 2014 - ieeexplore.ieee.org
Q-learning is a one of the best model-free reinforcement learning algorithms. The goal is to
find an estimate of the optimal action-value function called Q-value function. The Q-value …

Interaction models for multiagent reinforcement learning

R Ribeiro, AP Borges… - … Intelligence for Modelling …, 2008 - ieeexplore.ieee.org
This article proposes and compares different interaction models for reinforcement learning
based on multi-agent system. The cooperation during the learning process is crucial to …

Reinforcement learning in R

N Pröllochs, S Feuerriegel - arXiv preprint arXiv:1810.00240, 2018 - arxiv.org
Reinforcement learning refers to a group of methods from artificial intelligence where an
agent performs learning through trial and error. It differs from supervised learning, since …

An experience replay method based on tree structure for reinforcement learning

WC Jiang, KS Hwang, JL Lin - IEEE Transactions on Emerging …, 2019 - ieeexplore.ieee.org
Q-Learning, which is a well-known model-free reinforcement learning algorithm, a learning
agent explores an environment to update a state-action function. In reinforcement learning …

[PDF][PDF] Addressing the policy-bias of q-learning by repeating updates

S Abdallah, M Kaisers - … of the 2013 international conference on …, 2013 - ifaamas.org
ABSTRACT Q-learning is a very popular reinforcement learning algorithm being proven to
converge to optimal policies in Markov decision processes. However, Q-learning shows …

A distributed q-learning algorithm for multi-agent team coordination

J Huang, B Yang, D Liu - 2005 International Conference on …, 2005 - ieeexplore.ieee.org
Q-learning is an effective model-free reinforcement learning algorithm. However, Q-learning
is centralized and competent only for single agent learning but not multi-agent learning …

[PDF][PDF] An analysis of Q-learning algorithms with strategies of reward function

S Manju, M Punithavalli - International Journal on Computer Science and …, 2011 - Citeseer
Q-Learning is a Reinforcement Learning technique that works by learning an action-value
function that gives the expected utility of taking a given action in a given state and following …

Regional cooperative multi-agent q-learning based on potential field

L Liu, L Li - 2008 Fourth International Conference on Natural …, 2008 - ieeexplore.ieee.org
More and more artificial intelligence researchers focused on the reinforcement learning (RL)-
based multi-agent system (MAS). Multi-agent learning problems can in principle be solved …